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APPROACHES TO UNIVOCAL FACTOR SCORES 


J. P. GUILFORD 
AND 
WILLIAM B. MICHAEL 


UNIVERSITY OF SOUTHERN CALIFORNIA 
PRINCETON UNIVERSITY AND THE COLLEGE ENTRANCE EXAMINATION BOARD 


Two current methods of deriving common-factor scores from 
tests are briefly examined and rejected. One of these estimates a 
score from a multiple-regression equation with as many terms as 
there are tests in the battery. The other limits the equation to a 
few tests heavily saturated with the desired factor, with or without 
tests used to suppress the undesired factors. In the proposed meth- 
ods, the single best test for each common factor is the starting 
point. Such a test ordinarily has a very few undesired factors to be 
suppressed, frequently only one. The suppression test should be uni- 
vocal, or nearly so. Fortunately, there are relatively univocal tests 
for factors that commonly require suppression. Equations are of- 
fered by which the desired-factor test and a single suppression test 
can be weighted in order to achieve one or more objectives. Among 
the objectives are (1) maximizing the desired factor variance, (2) 
minimizing the undesired factor variance, (3) a compromise, in 
which the undesired variance is materially reduced without loss in 
desired variance, and (4) a change to any selected ratio of desired 
to undesired variance. A more generalized solution is also suggest- 
ed. The methods can be extended in part to the suppression of more 
than one factor. Equations are derived for the suppression of two 


factors. 


The present problem and its solution* arose in connection with 
the recent construction of a battery of aptitude tests (2) each of 
which was intended to provide a unique score for one common fac- 
tor, or in Thurstone’s terminology, one primary ability (3). In the 
construction of the battery and in the development of the solutions 
here proposed, several general assumptions were made: 


1. That the nature of human aptitudes is best described by 
means of group factors or reference variables. 

2. That an orthogonal system of these reference variables will 
suffice for most purposes. 

3. That in the practical uses of tests for personnel selection and 
classification, and particularly for vocational guidance, the most eco- 
nomical, convenient, and meaningful approach is through the use of 
univocal factor scores. 


* This paper is based upon a report read by the senior author at a joint 
meeting of the Western Psychological Association and the Institute of Mathe- 
matical Statistics at San Diego, California, June 19, 1947. 
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A univocal factor score is defined as one having variance in only 
one common factor, its remaining variance being attributable to er- 
rors of measurement plus possible specific variance. When the term 
“factor” is used in what follows, a common factor will be meant. 

The attempt to achieve univocal factor scores encounters the dif- 
ficulty that up to the present time, at least, it has been impossible to 
construct factorially pure tests except for a very few factors. Most 
tests have appreciable, if not substantial, variances in two or more 
common factors. To that extent, each raw test score is ambiguous in 
meaning as well as in predictive value. To meet this situation, two 
solutions have previously been suggested. 

One type of solution, which is mathematically appealing but prac- 
tically forbidding, is to derive a multiple-regression equation predict- 
ing individual differences in the desired factor from a. weighted com- 
bination of all tests in a battery. If batteries are large, as vocational 
selection and guidance require for effective results, this procedure is 
intolerably cumbersome. The entire battery would have to be admin- 
istered in order to measure any one factor, and the computation of 
scores would be a large clerical task. Furthermore, the resultant fac- 
tor composition of the composite score would vary from battery to 
battery and would probably not be entirely univocal in any case. 

A second type of solution, which is more economical than the first, 
has some of the same objections and an additional weakness. This 
solution involves a weighted combination of a very few tests most 
heavily loaded in the same factor. As in the first solution, this com- 
posite score will be likely to contain additional factor variances. The 
new weakness is that whereas in the first solution the superfluous 
factor variances may be partially or completely suppressed by the in- 
clusion of tests that do not measure the desired factor but do meas- 
ure to a much higher degree the secondary factor variances, in the 
second solution the suppression variables are lacking unless inten- 
tionally selected and included. Unless suppression tests are used, the 
intercorrelations of the factor scores are larger than the intercorre- 
lations of the factors, because of overlap in extraneous factor vari- 
ances. 

A third type of solution, which is to be proposed, overcomes most 
of the disadvantages of the first two. The principle of it is, first, to 
devise or to select the one best test for each factor—best in terms of 
maximal loading in the desired factor and minimal loadings in addi- 
tional, undesired factors—and second, to use good suppression tests 
by means of which the unwanted variances can be eliminated or mini- 
mized without intolerable loss in the variance we wish to measure. 

The situation seems very favorable for the third type of solution, 
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for several reasons. While there are very few factors for which there 
are univocal scores without suppression of undesired variances, there 
are many tests in which there is significant variance in only one factor 
in addition to the one we wish to measure. The suppression of a single 
factor in a test is a relatively simple matter. Fortunately, the factors 
that are quite frequently found as secondary in tests are precisely 
those for which there are already relatively univocal scores. Those 
factors are verbal ability, numerical ability, and perceptual speed. For 
many a factor, then, the problem is solved by using a weighted com- 
bination of two tests. This case is so common that it will pay us to 
reduce the weighting procedure that is required by it to a simple, 
routine form. In doing so, there are a number of considerations that 
do not meet the eye until one gets into the problem further. Several 
different objectives have to be considered, and when the objectives 
are incompatible decisions have to be made as to which ones are more 
important. It is the main purpose of this paper to consider those ob- 
jectives and means of achieving them. 


The Suppression of a Single Factor 
First, a number of symbols must be defined: 


Let X,,= a score in test M whose common factor D we desire to 
measure; 
X,= a score in a suppression test S that is relatively unique in 
the undesired factor, U; 
X,=a score designed to measure univocally factor D, derived 
from any weighted combination of X,, and X,; 
om — the standard deviation of scores in test M; 
o, — the standard deviation of scores in test S; 
Tam = the loading of factor D in test M; 
Yas — the loading of factor D in test S; 
Tum = the loading of factor U in test M; 
Tue — the loading of factor U in test S; and 
T'me — the correlation between tests M and S. 


The weights to use for scores in tests M and S will depend upon 
what objective one wants to achieve. There are several choices of 
objectives. There is a possibility of increasing the variance in the 
desired factor while decreasing somewhat the loading in the unde- 
sired factor. There is a possibility of decreasing the undesired vari- 
ance to zero, but in so doing there is a danger that the desired vari- 
ance will be lowered more than can be tolerated. In view of these pos- 





es 
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sibilities, let us consider four objectives and their attainment. In all 
four solutions, for the sake of practical convenience it is desirable to 
require that the weight of the score in test M shall be +1. The weight 
for test S will then take on some value that is negative and probably 
less than 1. This will be called the suppression weight. The practi- 
cal scoring formula will then be of the form: the score in test M 
minus a fraction of the score in test S. Analytical derivations of the 
weights will be found in the Appendix to this article. In what follows 
here, we shall merely report the end results and their implications. 


Case 1. Objective: To maximize the variance of the desired factor in 
the composite score X,. 


Here we want an equation of the form X, = Xm + w,X;, in which 
w, is the weight to be applied to a score in test S when the weight 
for test M is +1. The relative weights for X,, and X, that will satisfy 
this objective will be those consistent with the familiar multiple-re- 
gression equation which maximizes the correlation between one vari- 
able (here factor D) and a weighted sum of two other variables (tests 
M and S). The following solution for w, is found: 


i (Tas = Tan¥ an) om 


( Yan — Tins) Os 





Ws (1) 

This formula has been: applied to an arithmetical-reasoning 
test. It was assumed that this test had loadings of .5 in each of two 
leading factors—general reasoning and number ability.* Suppose that 
we want to suppress the number variance by combining with this 
test one of numerical operations, which is pure for the number factor, 
with an estimated loading of .7. With o,, = 14.9 and o, = 19.6, the 
weight, w, , which will maximize the loading in general reasoning, by 
equation (1), is —.30. This weight yields a multiple correlation (that 
is, the loading of the general reasoning factor in the composite) of 
.55, which represents a small gain of .05. What we are most inter- 
ested in is the loss of number-factor variance which should follow 
from the use of a negative weight for the number test. Following 
application of the general equation for the correlation of sums, the 
number-factor loading is estimated to be .24 in the composite score. 
The reduction of loading in number variance from .50 to .24 is con- 
siderable. It is even more appreciable when put in terms of propor- 
tions of variance. The drop in number-factor variance is from 25 per 
cent to about 6 per cent. There is still more of it remaining than we 
would like; and if the magnitude of the negative weight is increased 


* The test data used in this article were chosen from the AAF research re- 
ports (1). 
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for the suppression test, the number variance can be reduced still 
more without appreciable loss in reasoning variance, as will be shown 
later. 

Some special variations of equation (1) are of interest. If the 
suppression test contains no variance in the desired factor, i.e., if 
Tas — 0, the weight becomes 

¢. 
10, =— Fae ( oe (2) 


Os 


When, in addition, the scores in the two tests have been scaled to the 
same dispersions, i.e, when 7a, = 0 and om = o; , the weight becomes 


W,—=——T ms - (3) 


In other words, under these special conditions, the suppression weight 
is numerically equal to the correlation between the two tests, with 
changed sign. If the only variance that tests M and S have in com- 
mon is in factor U, since 7m, then equals 7um7us , the numerical value 
of the weight for the suppression variable is the product of the two 
factor loadings of the undesired factor in the two tests. This is the 
weight, under these special conditions, that maximizes the desired 
variance in the weighted composite. 


Case 2. Objective: To minimize the variance of the undesired factor 
in the composite score X,, i.e., to reduce it to zero. 


It was seen that the solution under case 1 is likely to leave some 
undesired variance remaining in the composite. We now consider the 
effect of requiring complete elimination of the undesired factor. 

In solving for the suppression weight to meet this objective, we 
need merely assume that the correlation of factor U with the com- 
posite X, equals zero. Solving the equation for the correlation of 
sums when applied to this situation, we find that 


TumOm 


(4) 





WwW, =— 
TusOs 

Applying this formula to the arithmetic-reasoning test, we find 
that a weight of —.54 for the numerical-operations test would com- 
pletely suppress the number variance. By the correlation of sums, 
we find that the loading in the desired factor then becomes .52. Thus, 
in suppressing the number variance, which was indicated by a load- 
ing of .5 in the arithmetic-reasoning test alone, to a loading of zero in 
the composite, we have increased the loading in general reasoning 
from .50 to .52. This is .03 below the maximum possible loading for 
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general reasoning when these two tests are combined, but the cost is 
not too much to pay for the riddance of the remaining 6 per cent of 
number variance. 

One special variation of formula (4) is of interest, and that is 
for the case when °,, = o,. Then, 


w,=——. (5) 
Tus 

The suppression weight thus becomes the ratio of the undesired fac- 
tor loading in test M to its loading in test S. Compared with equa- 
tion (3), it is interesting to see that for maximizing the desired factor 
variance in the compesite the suppression weight is proportional to 
the product of the two loadings in the undesired factor, whereas for 
minimizing the undesired variance the weight is proportional to their 
ratio. 

The use of equation (4) does not always result in an increase in 
the loading in the desired factor. In fact, it can lead to a material 
decrease, as another application will show. A test called Mechanical 
Principles, which is similar to the Bennett mechanical-comprehension 
test, was found to have its chief variances in two factors, mechanical 
experience, with a loading of approximately .6, and visualization, with 
a loading of approximately .5. It is possible to measure the mechani- 
cal-experience factor in a unique test of a different kind with sub- 
stantially higher loading; hence, there is no practical reason in this 
instance for wanting to suppress the visualization loading. Suppose, 
since there is a relatively pure mechanical-experience test, that we 
attempt to suppress the mechanical variance in the Mechanical Prin- 
ciples test in order to achieve a univocal score for visualization. By 
equation (4), it was found that the suppression weight should be —.89 
(e,. = 86, o, = 83, fa. = .1, Tan = B, Tue = 7, and Ta, = 5). But 
while this weight would reduce the mechanical variance to zero, at 
the same time it would lower the loading in visualization from the 
original .50 to .44. This is an intolerable loss in measurement of the 
desired factor. There must be a smaller suppression weight that 
would lower the undesired factor loading to a negligible degree with- 
out entailing intolerable loss from the desired factor loading. This is 
a problem of cost and utility. We have two rational solutions to offer, 
which are described under cases 3 and 4. 

Case 3. Objective: To reduce the variance in the undesired factor to 


the point at which the desired variance in the composite is no 
lower than that in test M. 





This calls for no lowering of the desired factor variance when 
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we deduct a weighted score in the suppression variable. The solution 
is achieved by setting the equation for the correlation of sums equal 
to tam. There are two solutions for w, , one of which is when w, = 0, 
that is, when there is no deduction at all. The other solution gives 


2 (Tan? ns a TamV as) om 
(San* ae Tas”) Os 


When this equation is applied to the problem of the Mechanical 
Principles test, a weight of —.65 is required for the suppression test. 
This lowers the loading of the mechanical factor, which was originally 
.6, to +.18, and leaves the loading for visualization at .5. This might 
be regarded as satisfactory, unless it can be shown that the remain- 
ing mechanical variance (which is only 3 per cent of the total vari- 
ance) can be reduced materially at a cost, let us say, of not more 
than .01 of the desired factor loading. 

If the suppression test contains none of the desired factor vari- 
ance, i.e., if 714, = 0, equation (6) becomes 


(6) 





om 
Yo — 20 ms ( ae ) I (7) 
Gs 
If, in addition, op = oa; ; 
Ws =—_— 2h ne . (8) 


In comparing this equation with (3), it is interesting to note that the 
suppression weight required to minimize the undesired variance to 
the point where the desired variance is equal to that in test M is 
just double the weight required to maximize the desired variance, with 
some loss in undesired variance. The two weights, one equal to —1'ms 
and the other to —2rn;, are interesting landmarks in the solution to 
the suppression problem, when the suppression test contains no de- 
sired variance and when the two tests have equal total variances. Even 
when these two conditions are only approximately satisfied, the same 
may be true within practical tolerance limits. 


Case 4. Objective: To assure that the desired variance in the com- 
posite will bear a certain ratio to the undesired variance. 

Let the ratio of 71.2 to ruc? be K and assume that the weight for 
X» equals +1, as before. Stating the two correlations ra, and Tuc as 
correlations of sums, squaring these expressions, and solving for the 
weights, we find the formula 


a (amTas ras. Kun us) + (1 usTam ar TasTum) VK om 


(9) 





W, = 


(Tas? — Krys") os 


IEE OS SARS OS 


Sonya 











i 
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If ra = 0, 
—— Rie. + Tam VK Om 
Ww, = —, (10) 
Kr.. Cs 
If Vas = 0 and om — Cs , 
Tum Tam 
vw,=—-— = on, (11) 
Tus TusVK 


Now it can be seen that the first term at the right in (11) is identical 
with the suppression weight required to suppress the undesired fac- 
tor variance to zero and the second term is an increment which is a 
function of K. The two values that satisfy any selected K ratio (un- 
der the conditions of equation 11) lie at equal absolute distances be- 
low and above that value of w, at which 7, is zero. One of these 
values would result in a small positive loading for the undesired fac- 
tor and the other would yield a small negative loading. Since we would 
probably not want a negative loading in any factor, the former of 
these two solutions would be preferred. 

Applying equation (10) to the Arithmetic Reasoning test with 
K = 100, we find that the two solutions for w, are —.49 and —.60. 
We previously found that a weight of —.54 for the numerical test 
would suppress the undesired variance to zero. A weight greater than 
that, such as —.60, would lead to an over suppression and a negative 
value for 7... We therefore reject the second of these two solutions 
and accept the first. This would leave some undesired variance, but 
it would be only 1/100 of the desired variance, whereas originally the 
two variances were equal. Since we know from previous results that 
the desired variance is approximately 25 per cent of the total, the 
undesired variance must be close to 1/4 of one per cent. This would 
mean a loading, 7, , of about .05. The loading could be more exactly 
computed as was done previously by the correlation-of-sums formula. 

In applying case 4 to the Mechanical Principles test problem, we 
must use equation (9), since 7,, ~ 0.0. Again, let us require a K ratio 
of 100. The solution gives suppression weights of —.78 and —.93. 
Since we know from an earlier solution by case 2 that a weight of 
—.89 reduces 7, to zero, the first of these two values is preferred. When 
w, was —.89, we found an intolerable loss in the desired variance. 
When w, is —.78, the desired factor loading drops to .47. This is prob- 
ably too great a loss when we consider the fact that the undesired fac- 
tor would have a loading of only .047. A larger value of 7, could 
well be accepted. By choosing other K ratios, for example one of 25, 
and by checking upon the resulting values of ra and Tuc, we would 
arrive at an acceptable suppression formula. 
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In spite of shrewd choices of K in the application of case 4, there 
may be too much trial and error to suit the average investigator. It 
may involve little additional work to follow the procedure next to be 
described, and the satisfaction with the results will often justify the 
price. We will now discuss the general case of the suppression of one 
factor. The cases already mentioned are only four among many. They 
happen to be of particular interest, and they satisfy rational objec- 
tives. We now ask what happens to the loadings in both the desired 
and undesired factors as the suppression weight for test S varies con- 
tinuously from zero to —1.00. Based upon the equation for the corre- 
lation of sums, two functional relationships can be derived to fit any 
particular situation. The general formula for computing 74, is given 
in equation (18) in the Appendix. To simplify computations, let us 
assume that o,, = o¢, = 1.00, in which instance 


Yam + WY as (12) 





Ydc —— 





Wi + a + QW. ne 


By analogy, the undesired factor loading in the composite is given by 
the equation 
Tum + WY us 


; (13) 
V1 + w? + 2w.%ns 





Tuc = 





Let us assume the special case to fit the Arithmetic Reasoning 
test, in which fam = .5, Pum = .5, Tas = 0.0, Tus = .7, aNd Tms = .85. The 
working equations then become 


5 
yYivte + 78, 





Vac = 





and 
is wae 73 


Vit we+.7, 


If we assign in turn arbitrary values to w, , the corresponding values 
of Ta and 7u< can be computed. 

To illustrate what happens in a more general way, we have let 
w, vary from zero to —1.0 by steps of 0.1, for sets of values when fing 
was allowed to vary by steps of 0.1 from a minimum to .6. The mini- 
mum 7;,; occurs when the only factor common to M and S is factor U , 
assuming a positive manifold. We have also chosen cases in which 
Tus Varies in steps of 0.1 from .5 to .8. 

Table 1 provides resulting factor loadings for D and U in the 
composite score for each set of conditions. Fig. 1 shows graphically 





7 uc 
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TABLE 1 

Loadings of Desired Factor D and of Undesired Factor U in a Weighted 
Combination of Two Tests, M and S (r,,, %,-) of Different Degrees of Inter- 
correlation (7,,,) When the Weight of Test M is +1.00 and the Weight (w,) of 
the Suppression Test S is Varied, When Loadings of Desired and Undesired Fac- 
tors in Test M (Tan, Tym) are .50, When Loadings of Factor U in Test S (r,,) 
Vary, When the Loading of Factor D in Test S (1,,) is Zero, and When Stand- 
ard Scores Are Used in Both Tests* 








1, 25 30 35 40 50 60 





We Tae Tue Tac Tuc ae Tue de Tuc ae Tue ae Tuc 


-—1 510 459 513 462 516 464 518 467 524 472 530 477 
-—2 516 413 521 417 527 422 533 426 546 436 559 425 
-—3 516 361 524+ 367 583 373 542 380 563 394 585 410 
Tyg —-5 -4 510 306 521 313 583 320 546+ 327 574 344 606 364 
-—5 500 250 513 256 527 264 542 271 577+ 289 620 310 
-6 485 194 500 200 516 206 583 213 574 229 625} 250 
-7 468 140 484 145 500 150 518 156 563 169 620 186 
-8 449 090 464 093 481 096 500 100 546 109 606 121 
-9 429 043 444 044 460 046 479 048 524 052 585 059 
-10 408 000 423 000 439 000 457 000 500 000 559 000 


-1 518 451 516 454 518 456 524 461 530 466 

-.2 521 396 527 401 533 405 546 415 559 425 

-.3 5247 335 533 341 542 347 563 360 585 375 

Ty,— 6 —A4 521 271 5383 277 546+ 284 574 298 606 315 
-.5 5138 205 527 211 542 217 577+ 231 620 248 

-.6 500 140 516 144 583 149 574 161 625} 175 

—.7 484 O77 500 080 518 083 563 090 620 099 

-.8 464 018 481 019 500 020 546 022 606 024 

-9 444 -035 460-037 479 -038 524 -042 585 -047 

-10 423 -085 439 -088 457 -091 500-100 559 -112 

-1 516 444 518 446 524 451 530 456 

—.2 527 3879 5383 384 546 393 559 403 

-.3 5383 309 542 315 563 326 585 339 

-A4 5383 235 5467 240 574 252 606 267 

T,,= 1 -5 527 158 542 162 577+ 173 620 186 
-.6 516 083 533 085 574 092 625+ 100 

-.7 500 010 518 010 563 011 620 012 

-8 481 -058 500 -060 546 -065 606 -073 

-9 460 -120 479 -125 524-136 585 -152 

-1.0 439 -175 457 -182 500 -200 559 -224 

-.1 518 485 524 440 5380 445 

-.2 5338 362 546 371 559 380 

-3 542 282 563 293 585 304 

T.,— 8 -4 5464 196 574 206 606 218 
-.5 ‘ 542 108 5777 115 620 124 

-.6 5383 021 574 023 625+ 025 

-.7 518 -062 563 -068 620 -074 

-8 500 -140 546-153 606 -170 

-9 479 -211 524 -231 585 -257 

-1.0 457 -274 500 -300 559 -335 








* All values for fac and ‘uc are multiplied by 1000 in order to eliminate decimal points. 
+ Maximum possible value for tac for the given Tus and Tms (Ws = -Tms). 
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FIGURE 1 
Regressions of the Desired-Factor Loading (7,,) and the Undesired-Factor 
Loading (7,,.) in the Composite Score as the Suppression Weight Varies from 
Zero to -1.0 


the regressions of the two factor loadings, 7a. and 7u-, upon ws, when 
ys = .7. Each regression curve is for a constant value of 7m; . 

From the regression curves as well as from the table, it will be 
seen how the formulas for cases 1 to 3 are satisfied. In each regres- 
sion, the maximum rz. comes when w, = — 7'ms. The desired factor 
loading, 71, equals 7am When w, = — 21m. The loading of the unde- 


sired factor, 7u-, equals zero when Ws = —um/Yus = —-714. In this 
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connection, it will be noticed that the two regression curves for Tuc 
intersect when r,, = 0.0 and when w, = —.714. This checks with the 
fact that w, is independent of 7, , and regardless of the value of 1ns 
it equals the ratio —Tum/Tus . 

Some general conclusions may be drawn from the regression 
curves. When 7,,, ~ 0.0, the value of 7.. first rises as w, increases 
negatively and then falls after a maximum 7,;, is reached at ws —=—T ms - 
For different values of 7,,,, there is relatively little change in 7. for 
wide variations of w, , within the range represented. The value of fuc 
drops al: ost linearly and quite rapidly, however, as w, increases nega- 
tively. This is a very fortunate circumstance. Also to be noted is the 
fact that the regression of 7, upon w, is not very sensitive to the size 
Of 7ms, Whereas the regression of 7;- upon w, is notably sensitive to 
that correlation. In this connection it would appear that a large Tine 
is more favorable than a small one. It leads to a greater rise in de- 
sired variance per unit increase in suppression weight and allows for 
a higher maximum 7,.. On the other hand, the smaller the size of 
Tm, the more precipitous the loss in undesired variance to the point 
at which 7, = 0.0. This is a very minor consideration, however, in 
view of the very slight effect of 7,,, upon the slope of the 7, regres- 
sion curve. 

A pair of regression curves, one for 7. and one for 7,-, could be 
drawn to fit any test situation. Inspection of the regressions would 
show what effect any particular suppression weight would have upon 
the two factor loadings. Decision as to the best weight could be 
reached when the various costs and benefits are taken into account. 
This procedure is recommended when results obtained under the con- 
ditions of cases 1 through 4 are unsatisfactory. 


The Suppression of Two Factors 
The methods of suppression of a single factor may be extended 
in some respects to the suppression of two undesired factors. The 
same principle is recommended, namely, to use a single test to repre- 
sent the desired factor and a single test, preferably unique in the un- 
desired factor, to suppress each secondary factor. The discussion will 
parallel that preceding as far as possible. 


Case 1. Objective: To maximize the variance of the desired factor 
in the composite score. 
For this purpose, an equation of the type X,. = X, + w.X, + w3X3 
is desired, in which 


X.is the composite score approaching uniqueness in a desired 
factor, factor O; 
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X, is the score in a test that is strong in factor O; 

X, and X; are scores in two suppression tests, the factors N and 
P of which are to be suppressed ; and 

W, and w; are the suppression weights for X, and X; , respective- 
ly, when the weight applied to X, is equal to +1.0. 

The solution to this problem gives the following formulas: 


(112 — 11323) 1 








W2 = — 14 
: (1 — 1e:?) on sas 
and 
133 — Ti21%2 1 
w;=— (Tus ee : (15) 
(1 — 137) o3 


It will be seen that, ignoring the standard deviations in (14) and 
(15), the chief component of the right-hand term is equivalent to a 
beta coefficient (with negative sign) that is obtained in a three-vari- 
able multiple-regression equation. With the standard deviations in- 
cluded, each right-hand term is equivalent to a b-coefficient, or par- 
tial regression weight (with negative sign). The result is analogous 
to that reported in equations (2) and (3). 

As an example of the kind of results that may be expected from 
this approach, let us assume that the Arithmetic Reasoning test has 
a loading of .2 in the perceptual-speed factor in addition to its load- 
ing of .5 in reasoning and .5 in numerical facility. Assume a loading 
of .8 for the numerical factor in a Numerical Operations test and a 
loading of .7 for the perceptual-speed factor in another suppression 
test which also contains a loading of .1 for the number factor. By 
formulas (14) and (15) the suppression weights are —.42 for the 
number test and —.19 for the perceptual test. The consequent load- 
ings in the composite score are as follows: reasoning, .57; number, 
.16; and perceptual speed, .08. 


Case 2. Objective: To minimize the undesired factor variances. 


In this approach, we want both 7., and 7.» to be zero, where 1cn 
and 7.» are loadings in the two undesired factors in the composite 
score. The weights that will accomplish this are: 








(Trl ps — Vm nz) & 
’ w.=— ni! p3 pi! n3) O01 (16) 
(Tne? ps aaa T nz) 02 
and 
(Trt — etm) o 
Ws = n pl ni 1 (17) 


? 
(Tn2% ps = Tens) 03 


under the special conditions that the weight for X, = +1.0 and that 
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nr 
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neither of the suppression tests contains variance in the desired fac- 
tor, i.e., Tao = Taz — 0.0. 

In order that the conditions of the problem just previously used 
as an illustration for case I may be satisfied, the two weights must 
be —.59 and —.29 for number and perceptual tests, respectively. These 
weights, while reducing 7., and 7, to zero, leave 7o-, the desired fac- 
tor loading in the composite, at .56. The complete riddance of the resid- 
ual amounts of the two factors after application of case 1 seems well 
worth the cost of a drop of only .01 in the desired factor loading. 

The requirements of cases 3 and 4, as stated when applied to the 
suppression of one factor, lead to derivations that would entail a pro- 
hibitive amount of labor when applied to the suppression of two fac- 
tors. Solutions for these cases were therefore not attempted for the 
three-variable problem. Cases 1 and 2, with certain restrictions elab- 
orated upon in the Appendix, can be extended to apply to any number 
of test variables. It is believed, however, that it is neither practical 
nor necessary to go beyond the three-variable problem. Where neces- 
sary, effort should be made, instead, to reduce the number of common 
factors in tests to only one or two by better test construction. 


APPENDIX: DERIVATIONS OF FORMULAS 


A. Predicting a Factor Score From Two Tests With One 
Acting as Suppressor 
CASE I. To maximize the loading of the desired factor D in the com- 
posite score X,.. 


If two tests, M and S, are used in combination, the composite 
factor score may be expressed by the equation 


Xe=WnXen + Ws, 


in which X,, and X, represent, respectively, scores in test M and test 
S and w,, and w, stand for the weights to be applied to the respective 
test scores. As mentioned previously, test M is factorially complex 
with substantial loadings in two factors, D and U, the desired and 
undesired factors, respectively, from the standpoint of predictive 
preference. The suppressor test is relatively pure with respect to fac- 
tor U , although there may be a small loading in factor D. 

The correlation of factor D with the weighted sum of scores in 
the two tests is given by the following equation 


WmOml am + W303 as 
tic ’ ( 18) 








V Wn? 0m? + W,;70,7 + 2ZWimWsOmOsT ms 
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where X. = WmXm + WsX¢. 

In the right-hand member of the equation the o’s and the 7’s are 
arbitrary constants, and wm and w, are the independent variables; 
Tac is, of course, the dependent variable. In the present case, it is the 
objective to maximize Tac . 

A necessary, but not sufficient, condition for a maximum (or 
minimum) of a continuous, single-valued function 


U= f(Xr5°9+,Xi,e++, Xn), 
the first-order partial derivatives of which are continuous, is that 


om en ee A 
OX, > 5] > e 





OX; , OXn 


Provided that Wom? + W,705" a 2WmWeOmOs' ms = 0 8 Tac will be 
a maximum when 


Or ac or, 
“=0 and —=0. 
OWm Ow; 








The two first-order partial derivatives of (18) are as follows: 


OT ac x Ws0m0s [ws Cs (Tam = TmsT as) + Wmom (TamTms .. Tas) J 


(19) 
OWm (W20m? + W,70,7 + 2WmWeomos1ms) aie , 








Or ae _ WmOmes [wees (Tas? ms at Tam) + Wnom (Tas a Tam?ms) ] 


(20) 
Ow, (Wr20m? + W204? + 2ZWmWeomost ms) */” 








When each of the partial derivatives is set equal to zero, the fol- 
lowing two identical linear homogeneous equations result: 


Gs (Tam — TmsVas) Ws + om (Tanne — Tas) Wm = 0 (21) 
Os (Tam ees Ts" as) Ws + on (Tanase aa Tas) Wm =0 (22) 


provided that wm #0, ws #0, om #0, 0,40, and Wm2on? + W705" + 
2WmWsomos'ms > 0. This outcome is not surprising inasmuch as a sec- 
ond restricting function involving w, and w, was not present prior to 
differentiation. Inspection of (18) will reveal that w,, and w, can be 
assigned an infinite number of values without changing the value of 
Tac provided that the ratio of w, to wm is always constant. Hence, if 
kw,, and kw, are substituted into (18) for wm and w,, respectively, 
where |k| # 0, the factor k in the numerator will cancel with the 
factor k in the denominator. 

Since the rank of the matrix of the coefficients correspond- 
ing to the two unknowns in the two equations (21) and (22) is unity, 
one of the unknowns may be assigned an arbitrary value, and the 
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second unknown may be solved in terms of the first provided that the 
coefficient of either unknown is not zero. It may be said, in effect, 
that one degree of freedom is present among the two weights. 

If the weight of the suppressor test is desired, it is easily ob- 
tained from (21) and (22) as: 


Ma (Tas — Taml ms ) om 


(Tum — Tas'ms ) Os 


Wm. (23) 





Ws 


For convenience, w,, may be assigned a value of unity. If, more- 
over, 7a, vanishes, then equation (23) simplifies to 
TmsOm 


w,=—— : (24) 


Gs 





If c» is equal to o,, or if data are expressed in standard scores, 
then equation (24) reduces to 


Ws =—Tms + (25) 


If the assumptions of (25) are true in a given situation, the sub- 
stitution of —~,,, for w., of +1.00 for wm, and of 1.00 for om and o; 
(or of o, for o, , and vice versa) results in a simplified equation for 
(18): 


Tam 
Tac = ——————.. (26) 
V 1 nie: Vas 
The equation for 7,- , the correlation of factor U with the weight- 
ed sum of the scores of tests M and S, upon fulfillment of the same 
assumptions as in (25) and upon the appropriate substitutions in 
(28), simplifies to: 


Tum — TmsY us 
Tue = —————_-. (27) 
V1— me 


CASE II. To reduce the loading of the undesired factor U in the com- 
posite to zero. 
If ruc is equal to zero, the numerator of the right-hand member 
of the equation 
WmOm" um + Ws us 


Tuc = (28) 
V Wn? om? + W047 + 2WmW Omoslms 








must equal zero. That is, provided that 


Wargn? + W,20,7 + 2WaW Cat sTas > 0, 
(29) 


WmOmum + W061 us = 0 . 
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The weight of the suppressor test upon solution of (29) for w, is: 


Tumom 
wv, Wm. (30) 
TusOs 


If wm is unity and if om is equal to «, , equation (30) simplifies to: 





Tum 


w,=— ; (31) 
Tus 





CASE III. To reduce the loading in the undesired factor, 7,-, to the 
point at which ra is equal to ram . 


The problem is to ascertain the value of w, such that 


Wimoml am + Wstsl as 





Tam 


(32) 





V Wn on? + W470" i 2WmWsOm0sl ms 

After the squaring of both sides of equation (32) and transpos- 

ing of terms, the following expression results: 
os” (Tan? Pr Tas”) W,? + 2omes (Team? a Tam as) Wn; = 0. 
The two solutions for this quadratic equation in w, are: 
w,=0 
and 
—2. (TmeTam? — Tamas) Om 
_— (7's am amTas) 0: 7 (38) 


(Tam? — Tas”) os 


If w, is equal to zero, the suppressor test obviously is eliminated 
from the composite. In this instance rz. is merely equivalent to Tam . 
If ra, is zero and if w,, is unity, then (33) reduces to: 





om 
Ws = —21 mg —.- (34) 


Gs 


Finally, if the two standard deviations are equal (34) then be- 
comes: 


Ws = —2r ms - (35) 


CASE IV. To make the ratio of the desired factor variance 72,, to the 
undesired factor variance 7°, a given constant K. 


The ratio of ra? to ru-2, which is to equal a given constant 
K (K 2 0), is given by: 
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(WmomTam + W051 as) , 





Tac? Wn? omn? + W705" + 2ZWmWsOmOsT ms 











=K, (36) 
Tuc? (WinomT um sa Ws us)? 
Wm? Om? +: W,2047 + LW WeomosT ms 
where 
fiers, 
and 


Wom? + W,20,7 + 2WmWeomes'ms > 0. 


Upon algebraic simplification of the two expressions to the right 
of the equal signs, a quadratic equation in w, and w,, results: 


o,? (74," — Kus?) Ww,” > Zomes (TamTas Soe BuaNews) WnWs 


3 
+ On? (Pan? — Kun?) =0. (37) 


When w, is solved in terms of w,, through use of the familiar 
quadratic formula, the following formula is obtained: 


—— (Tam? as nie Rt se) = (%usTan — TaT und VK om 
w, = — Wm. (38) 


(t4,* a Kus?) Cs 


If ras is equal to zero and if w,, is unity, equation (38) becomes: 





| a + Talay & 
W, = , (39) 
sino Kr .,? 





or, more simply, 
as |< am — Tim K 
Kru; ; 





(39a) 


An interesting corollary to Case IV may be developed which ac- 
tually amounts to a second derivation of Case II. If 7,,? is zero, for- 
mula (36) fails to have meaning. However, if the reciprocal is taken, 
that is, 


2 
Tuc 


an ; 
then a formula may be derived to ascertain the weight required for 
a zero loading in the undesired factor in the composite. By a proce- 
dure similar to the one followed in the derivation of (38), it may be 
shown that 





w, = — Wn. (38’) 


| - don aie K’ranas) + (Tact um ayn Pe | = om 
Cs 


Tus? an K'T as? 
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Therefore, if 74: is equal to zero (and hence 7,2 and K’ are equal 
to zero), then the weight of the suppressor test is: 


; Tum?m - 
w, =— Wm; (39’) 
Tuss 





which is equivalent to formula (30). 


B. Predicting a Factor Score From Three Tests With Two 
Relatively Pure Tests Acting as Suppressors of 
the Variances in Two Factors 


In the suppression of more than one factor, the arithmetic labor 
required in the derivation of formulas corresponding to Case III and 
Case IV, in which more than one suppressor test is used, is prohibi- 
tive. Therefore, only Case I and Case II will be considered. In fact, 
the formulas derived for Case I are restricted to the instance in which 
each of the suppressor tests contains no loading in the desired factor. 
CASE I. To maximize the loading of the desired factor O in the com- 

posite score X,. 

If three tests are employed in a composite, the factor score may 

be expressed by the equation: 


Xe = WX a Wore + W3X3 , 


in which X, , X,, and X; are the respective scores in tests 1, 2, and 3 
and w,, w., and wy; are the corresponding weights applied to the re- 
spective test scores. Test 1 is loaded substantially in three factors O, 
N, and P. On the other hand, test 2 is saturated with factor N; and 
test 3, with factor P. In either test 2 or test 3, small loadings may be 
present in one of two factors other than O, N, or P. 

The correlation of the desired factor O with the weighted sum of 
scores in the three tests 1, 2, and 3 is expressed as: 


W031 91 TF WeG2To2 + W031 03 
» (40) 





Toc = 





W,?o,? + W702" + W703" + 2W,We01021 12 isp 2W4.W301031'13 

+ 2w2W 0203123 

where c = w,X, + woX, + WsX33 

112,113, and 72; are the intercorrelations of tests 1, 2, and 3; 
Tor. » Too , ANd 73 are the loadings of tests 1, 2, and 3, respective- 
ly, in factor O; 

W,, We, and wy, are the respective weights applied to scores on 
three tests; and 
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01, 62, and o; are the standard deviations of scores in tests 1, 
2, and 3, respectively. 


As in the instance of two tests in section A, an infinite number 
of sets of weights which stand in a given ratio to one another may be 
substituted in formula (40) without changing the value of To. 
Therefore if w, is held constant, w. and w; may assume but one set 
of values. 

A necessary condition for a maximum value of 7o- is that 








OW. Ws 
The partial derivatives of (21) are as follows: 


W2?02793 (Tos — To2% 23) + WeW 1010203 (21 31'12— 102713 — 1018 23) 
+ WeW30203" (To3%23 — To2) + Wi1W30103 (To3713 — Tor) 
OP oc + W,?0,703 (103 — To17 13) 














Ow. As _ 
OP oc 
aw, 

W?02037 (To2 — 123703) + W2W10102" (127 12 — or) 

- sieges (102123 — Tos) + WyW3019203 (272713 — 3712 — 01128) 

+ W,701702 (12 — Tor1 12) , (42) 


As/2 


where 
A = W,70,? = Wo" G5" + W703" Se 2W1W01021' 32 7 2W,W303031'13 
+ 2W2W 30203122 > 0 


3 


3 2 
A=3 wo? +23 WiWjoi0;Ti; > 0 (j> i). 
w=1 pe 
If each of these partial derivatives is set equal to zero, the re- 

sulting two simultaneous quadratic equations may be solved for we 
and w; in terms of w,, which may be assigned an arbitrary value. 
Unfortunately, the solution of these two equations is unwieldy, and 
the formulas obtained are of such complexity that a technician would 
find them extremely cumbersome to employ. However, if tests 2 and 
8 may be assumed to contain no variance in the desired factor 0; that 
is, if 7%. = 0 and 7; = 0, the solution of the two simultaneous equa- 
tions is considerably simplified. Under the assumption that 7. and 
1, vanish, the two simultaneous equations are as follows: 
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W 0303 [021 011 23W2 + o3%o1W3 + O17 017 181] = 0; (43) 
W,0102 [G21 We + o3ToiT2sWs + o:TaT2Wi] =0. (44) 
Provided that 
w,>0 
«,>0 (t=1,2,3) 
T.1 #0, 
and 
1 123 
>0, 
To, «1 








W, and ws; may be solved in terms of w,: 


(112 — 113123) 01 








W,=—— W,} 45 
, (1 — 723”) o2 : * 
(113 — 112723) o1 
WwW; =— Wy. (46) 
(1 — 123”) o3 


These formulas may be written in more familiar notation: 


01 


We = — Bi2.3— Wr, 
02 
ss (45') 
=— Die.sW,; 
O1 
Ws = — Bis.2— W1, 
G3 
a (46’) 
= — by2.3W; 
where 
B __ 112 — 113128 
ae re 
1 pees Tas" 
__ 113 — Ti2%23 
B13.2 ee eee 


1 —_ Tos* 


O1 
Bie.3 = Biz.s ae 
Co 


O1 
Bis.2 = Bis.2—.- 
C3 
In other words the suppressor weights, w. and w; , are negative 
b weights multiplied by w,. If w, is unity, an interesting parallel may 
be drawn to that of the multiple regression equation in which a cri- 
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terion is being predicted from two tests. Test 1 may be thought of as 
the “criterion” or dependent variable, and tests 2 and 3 as the “tests” 
or independent variables. In absolute value the b weights are the same 
for tests 2 and 3 in the suppression problem as they would be for the 
two “tests” in the multiple regression equation in which test 1 might 
be conveniently considered the “criterion.” Only the signs are re- 
versed. Inspection of formula (24) reveals that a similar interpreta- 
tion holds for.two tests. 
CASE II. To reduce the loadings of the two undesired factors N and 
P in the composite to zero. 

If rn and 7, (the respective correlations of the undesired fac- 
tors with the weighted sums of the scores in tests 1, 2, and 3) are to 
equal zero, it is necessary that 


W011 ny + WeeP ng + Wal ass 


VA 





? 


(47) 





Wy Oil py T WeGeh po + W331 p33 _ 

; VA 
Provided that A > 0, the numerators of the two equations alone 
need to equal zero. If the rank of the matrix consisting of the co- 


efficients corresponding to the unknowns w, and w; is of rank two— 
that is, if 


then the solution of equations for w. and w,; in terms of w, , to which 
an arbitrary value, not zero, may be assigned, is given by the for- 
mulas: 

(nit ps age TnVns) O71 


w,=— Wi, (49) 


(Tn21 ps oo TT ns) 02 





(Tn21n er 1) 04 
Ww; =— —— wy. (50) 


(Tn21ps — Tue ns) os 
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A TABLE FOR ESTIMATING THE SIGNIFICANCE OF THE 
DIFFERENCE BETWEEN ik aeons PERCENTAGES 


FRANCES SWINEFORD 
THE UNIVERSITY OF CHICAGO 


A table to aid in testing the statistical significance of the differ- 
ence between correlated percentages is presented. Percentage points 
which range from .10 to .001 are included. 


In a recent issue of this journal McNemar developed formulas 
for calculating y? — or (CR)? — to test the statistical significance of 
the differences of two correlated percentages.* The formula which 
he recommends rests on the null hypothesis that there is no differ- 
_ ence and is written 
(D— A)? 

D+A 


where D and A are the frequencies indicated in the following four- 
fold table: 


f= (CR)*= 


? 




















Item x 
No Yes 
Yes A B A+B 
Item y 
No C D C+D 
A+C B+D N 


The sum, A + B + C + D=N,, is the total number of individuals in 
the group. The percentages to be compared are those which corre- 
spond to the marginal totals A + BandB+ D. 

Certain assumptions and limitations should be noted. McNemar’s 
formula is based upon the assumption that the three possible “scores,” 
namely, a change from no to yes, no change, and a change from yes 


*McNemar, Quinn. Note on the sampling error of the difference between 
correlated proportions or percentages. Psychometrika, 1947, 12, 153-57. 
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to no, are appropriately represented by the values +1, 0, and —1 and 
that the standard error of the mean of such scores can be measured 
in the usual manner. The null hypothesis that the population values 
are equal has already been indicated. Furthermore, McNemar warns 
that inasmuch as (CR)? is identical with y? the formula should not 
be used when A + D is less than 10. 

In case the items to be compared are such that the desirable re- 
sponse is yes in one instance and no in the other, the A and D values 
in the formula become the frequencies in the yes—yes and the no—no 
cells of the fourfold table. 

Appended herewith is a table of the values of A for selected 
values of D for which the probability of chance occurrence is .10, 
.05, .02, .01, .005, and .001. Critical ratios corresponding to these 
probabilities were substituted in the formula, which was first writ- 
ten as a function of A. Only alternate values of the argument are 
used, because linear interpolation is adequate for finding the missing 
frequencies in the table. The entries are carried to one decimal place 
in order to facilitate such interpolation. 

Since the function is symmetrical in A and D, it was decided 
arbitrarily to call the larger frequency “D” and to table the smaller 
frequency. Any value of A which is less than the entry in the table 
indicates significance beyond tiie corresponding percentage point. 
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Values of A for Selected Values of D for Estimating the Significance 
of the Difference between Correlated Percentages 


A =number of individuals succeeding in item x only. 
D=number of individuals succeeding in item y only. 


























D>A. 
Probability Probability 

D D 
10 05 6©6.02——siwOs«iw05s«=Cw01 10 050 «6©6.02——i«Os«iw005s«£w00 
10. 29 29 20 18 O89 2... 80... 60.5 57.1 58.2 50.6 48.2 43.4 
12. 52 41 30 23 16 0.4 82... 62.2 58.7 54.8 52.2 49.8 449 
14. 65 54 41 338 26 1.2 84 64.0 60.4 56.4 53.8 51.3 46.4 
16. 80 6.7 538 44 36 2.0 86... 65.7 62.1 58.1 55.4 52.9 47.9 
18. 94 80 65 55 46 2.9 88 67.5 68.8 59.7 57.0 54.5 49.4 
20. 109 94 %74 67. 68 39 90 69.2 65.6 61.4 58.6 56.1 50.9 
22. 124 108 90 79 69 49 92 71.0 67.3 63.0 60.2 57.7 52.5 
24. 18.9 12.2 104 92 81 6.0 94 72.8 69.0 64.7 61.8 59.3 54.0 
26. 15.4 18.7 11.7 104 93 7.1 96... 74.5 70.7 66.4 68.5 60.8 55.5 
28. 17.0 15.1 18.1 11.8 106 8.2 98 76.3 72.4 68.0 65.1 62.4 57.0 
30. 18.5 166 14.5 18.1 118 9.4 || 100 78.1 74.1 69.7 66.7 64.0 58.6 
32. 20.1 18.1 15.9 144 13.1 10.5 ||} 102... 79.8 75.9 71.4 684 65.7 60.1 
34. 21.7 19.6 17.8 15.8 14.5 11.7 || 104... 81.6 77.6 73.0 70.0 67.3 61.6 
36. 23.8 21.2 188 17.2 15.8 13.0 || 106... 83.4 79.3 74.7 71.7 68.9 63.2 
38 . 24.9 22.7 20.2 18.6 17.2 14.2 || 108 85.1 81.1 76.4 73.3 70.5 648 
40. 26.6 24.3 21.7 20.0 18.5 15.5 || 110 86.9 82.8 78.1 75.0 72.1 66.3 
42. 28.2 25.9 28.2 21.5 19.9 16.8 || 112 88.7 84.5 79.8 76.6 73.7 67.9 
. 44. 29.9 27.4 24.7 22.9 21.3 18.1 || 114 90.5 86.3 81.5 78.3 75.4 69.4 
/ 46. 31.5 29.0 262 244 22.7 19.4 || 116... 92.38 88.0 83.2 79.9 77.0 71.0 
48 . 33.2 30.6 27.8 25.9 24.2 20.7 || 118... 940 89.7 84.9 816 78.6 172.6 
50. 34.8 82.2 29.38 27.3 25.6 22.1 ||} 120... 95.8 91.5 86.6 83.3 80.38 74.2 
52. 36.5 33.8 30.8 28.8 27.0 23.4 || 122... 97.6 93.2 883 949 81.9 15.7 
54. 38.2 384.5 32.4 380.38 28.5 248 || 124... 99.4 95.0 90.0 866 83.6 177.3 
56. 39.9 37.1 33.9 31.9 30.0 26.2 || 126...101.2 96.7 91.7 8838 85.2 789 
58 . 41.6 38.7 35.5 38.4 81.5 27.6 || 128...108.0 98.5 93.4 90.0 86.9 80.5 
60 . 43.3 40.4 387.1 349 32.9 29.0 || 180...104.8 100.8 95.1 91.7 88.5 82.1 
62. 45.0 42.0 38.7 364 3844 30.4 || 182... 106.6 102.0 968 93.3 90.2 83.7 
64. 46.7 43.7 40.2 38.0 35.9 31.8 || 184... 108.4 103.8 98.5 95.0 918 85.3 
66. 48.4 45.3 41.8 39.5 37.4 33.2 || 186... 110.2 105.5 100.2 96.7 938.5 86.9 
68 . 50.1 47.0 43.4 41.1 389.0 34.7 || 188... 112.0 107.3 102.0 98.4 95.1 88.5 
70. 51.8 48.7 45.0 42.7 40.5 36.1 || 140... 113.8 109.1 103.7 100.1 96.8 90.1 
72. 58.6 50.3 46.7 44.2 42.0 87.6 || 142... 115.6 110.8 105.4 101.8 98.5 91.7 
74. 55.8 52.0 48.3 45.8 43.6 39.0 || 144... 117.4 112.6 107.1 103.5 100.1 98.3 
76. 57.0 53.7 49.9 47.4 45.1 40.5 || 146... 119.2 114.4 108.9 105.2 101.8 94.9 
78 . 58.8 55.4 51.5 49.0 46.7 42.0 || 148... 121.0 116.1 110.6 106.9 103.5 96.5 
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From the factorial analysis of a battery of nineteen individual 
tests seven factors were obtained, six of them being interpreted. 
Some of these factors are probably related to similar ones isolated 
in different batteries and by different authors. The test Progressive 
Matrices here analyzed is loaded in some of the factors, as was ex- 
pected according to the nature of its problems and to previous ex- 
perience with the test. The loadings of the tests and of the factors in 
the second-order factors are given. It is expected that this exten- 
sion of factor theory will prove useful for the isolation of more fun- 
damental psychological parameters, although at present it is difficult 
to evaluate precisely the theoretical and practical implications of 
this aspect of factor analysis. 


Introduction 

The present paper deals with the factorial study of a battery of 
individual tests, among which were included several of the perform- 
ance type, Raven’s Progressive Matrices and some others. None 
of the tests, as far as inspection can determine, has any verbal com- 
ponent, and it is expected that such a battery will give information 
about the space and perceptual factors. Raven’s Progressive Mat- 
rices, insofar as previous experience with the test seems to indi- 
cate, appears saturated in these same factors, and consequently its 
inclusion in the present battery may give clues about its factorial 
complexity. 

Since the basic works of Thurstone (21, 18), the space factor 
has been found in different batteries of the usual paper-and-pencil 
type. The suggestion has been advanced that there are several kinds 
of space factors or space-perception factors (5). The Minnesota 
form boards have been reported as saturated only in space (4). More- 
over, the space factor has become of considerable interest, not only 
from the theoretical point of view but also as a component of the 
so-called mechanical ability, in the sense that it has been found re- 
peatedly in studies of this kind (4, 5, 24, 23). 

Perceptual factors have been discovered in most of the studies 

* This study was carried on under a State Department Grant and a Frank’s 
Fund Fellowship at the University of Chicago. The author wishes to express his 
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mentioned above, and their nature seems to be rather complex and 
their relation with the space factor somewhat complicated. The de- 
velopment of tests of spatial and perceptual abilities has been strong- 
ly recommended for the purposes of occupational analysis (17). 

Performance tests have not been employed for factorial studies 
as frequently as the usual verbal paper-and-pencil type. F. Gaw (2) 
says that “Performance tests as above mentioned should be consid- 
ered as essentially measurements of intelligence,” where the manual 
response “is but a means to an end.” Since the materials commonly 
employed for testing intelligence are usually strongly dependent on 
scholastic achievement, information, the use of words, and so on, it 
would be interesting to know whether the same fundamental activ- 
ities which play a part in the performance of paper-and-pencil tests 
are found in a battery of this kind, where these influences are appar- 
ently absent. 

Moreover, group testing techniques have led psychologists to 
stress the importance of the end results of the activity called forth 
in testing. Few of the common paper-and-pencil tests give much in- 
formation about the way in which psychological activity develops 
during the actual performance of the problems. Therefore the con- 
clusions are based more in terms of the final results than in terms 
of the evolution of the process. 

It is known that the same end results can be obtained by differ- 
ent means. Each individual has a personal way of apprehending the 
problems presented to him and solves them accordingly. How are 
these different ways of approach effective for the solution of the 
problem? What is the rigidity or plasticity that a person possesses 
enabling him to use different patterns of action? To what extent can 
the knowledge of these facts be employed for the improvement of the 

/ individual? These and similar questions are of theoretical and prac- 
‘ tical importance, and it is expected that careful observation of each 
subject during the performance of the tests will contribute to their 
. understanding. 

Raven’s Progressive Matrices have been considered by Raven and 
Spearman as loaded in “g” (16). The analysis of the items and the 
reports of the subjects seem to indicate that the test was devised ac- 
cording to noegenetic principles. But the question is: Can its vari- 
ance be attributable to “g” only? Previous experience with over 
2000 cases (12, 13) seemingly reveals that different mental functions 
are active in the solution of the items. Moreover, the results obtained 
by means of the Raven tests are indicative of the fact that these have 
good discriminative power, can be used in a wide age range, and ful- 
fill most of the requirements needed for testing normal and handi- 
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capped subjects (3, 11, 12, 13). Therefore the knowledge of its fac- 
torial composition is of theoretical and practical value. 

Furthermore, the study of the second-order domain (12, 20) may 
contribute to the understanding of many of the problems here under- 
lined, since it appears to refer to more fundamental components. 


Description of Tests, Population, and Method 

The Seguin-Goddard form board (No. 1) ;* the Healy Construc- 
tion Test A (No. 2); the Diagonal test (No. 12); and the Triangle 
test (No. 13) are all scored in time, following the indications given 
by F. Gaw (2). In test 1 the score is the time taken in the shortest 
of three trials. Tests 2, 12, and 13 consist of rectangular wooden 
frames and a certain number of wooden blocks to fit in them. Test 
2 has six rectangular blocks; Test 12 has five wooden blocks, three 
triangular, one rectangular, and one trapezoidal; and test 13 has a 
square and a rectangular frame, both cut in the same wooden board 
where four identical triangles have to be fitted exactly. Each test has 
a time limit and the score is the number of seconds that the subjects 
employ for the solution of the test. 

In the Cube Imitation test, No. 11 (2), the experimenter places 
in a row in front of the subject four black cubes, all of the same size; 
then with a fifth cube he taps them according to a special pattern of 
varying difficulty, which the subject must then repeat. There are 
twelve different patterns, and the score is the number correctly re- 
produced. If the subject fails in three successive trials, the test is 
terminated. 

The Porteus Maze test (No. 8) has been scored and adminis- 
tered according to the instructions given by F. Gaw (2). 

The Object Assembly test was scored as three separate tests: the 
Mannequin (No. 3), the Face (No. 4), and the Hand (No. 5). The 
number of pieces placed correctly within a certain time limit gives 
the score. For more details, the reader is referred to D. Wechsler 
(22). 

Cancellation of letters, test 9, is similar to the current tests of 
the same name. 

In Cancellation of Figures, test 10, the subject has to cancel all 
the squares which have an extra line placed inside and in the same 
position as the one in the square drawn at the top of the page. ~ 

Digits forward, test 6, and Digits backward, test 7, are similar 
to the current tests of the same name. 

Drawing of a man, test 19, was scored according to the instruc- 


* The numbers following the names of the tests refer to the code number 
employed in the present article. 
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tions given by C. Burt (1). Three different judges scored each sub- 
ject. Tests 8 and 9 are both scored in terms of mental age, showing 
a very close agreement in the results. The means and standard devia- 
tions for test 19 and test 8 are, respectively: means 9.406 and 9.554, 
with standard deviations of 1.564 and 1.681. These values are not 
statistically different. 

Raven’s Progressive Matrices—Sets A, B, C, D, and E—were 
scored as separate tests: Nos, 14, 15, 16, 17, and 18, respectively. The 
whole series consists of 60 patterns in which a part is missing, and 
at the bottom of each pattern there are six or eight different parts 
that fit in the missing part, only one of which the subject has to se- 
lect to complete it. There is no time limit and the score is the num- 
ber of items answered correctly. In the present population the mean 
and standard deviation are 27.48 and 9.29. 

In the present study all the tests were given individually in dif- 
ferent sessions, none of them lasting for more than an hour. The 
whole period of testing was completed in no more than three ses- 
sions, each session in different days. 

The age of the 138 subjects of both sexes ranges between eight 
and fifteen years, with a mean age of 11 years 4 months, and a stand- 
ard deviation of 1.860. The subjects were randomly selected from 
different schools. 

The correlations between the variables are given in Table 1. The 
centroid matrix in Table 2 was obtained after three successive fac- 
torizations, that is, to the point where the communalities were stable. 
The residuals were negligible after seven factors were removed. Their 
root mean square deviation has a value of .023. 

The centroid matrix was rotated by different methods according 
to the development of the problem. In all, 23 radial rotations were 
made, plus several trials with the single plane method and other de- 
vices. The final matrix of transformation is given in Table 3. Table 
4 is the rotated factorial matrix, and Table 5 gives the cosines of the 
angles between the reference vectors. 

As was expected from the inspection of the correlations (see be- 
low), the tests cluster along the reference axes after a few rotations, 
although a satisfactory positive manifold was secured only after sev- 
eral rotations. 

The factorization of the correlations between the primaries, 
Table 6, led to the expression of the primary factors in terms of three 
centroids. This factorization was repeated several times in order to 
fix the communalities. The results are shown in Table 7. From here 
it is easy to express the loadings of the tests in the centroids of the 
second-order domain. These values are given in Table 8. 
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Tables 9 and 10 express the matrix of transformation (obtained 
by plotting directly on the sphere the normalized values of the fac- 
tors and the tests in the second-order centroids) and the cosines be- 
tween second-order reference vectors. Tables 11 and 12 give the satu- 
rations of the primary factors and of the tests in the new reference 
vectors of the second-order domain. 


Discussion and Results 

The inspection of the table of correlations shows several inter- 
esting facts. There is quite a wide range in the magnitudes of the 
coefficients and a number of them are negative, mainly, those corre- 
sponding to variables 1, 12, 13, and 2, although these four variables 
are positively correlated among themselves.* 

All these four tests were scored in time, and consequently better 
performance means less time. Reversing their signs, only fifteen neg- 
ative correlations remain for the whole table of 171 coefficients. Only 
one of these negative correlations may be considered as significant at a 
5 per cent level of confidence. The other ones have no statistical signifi- 
cance. In batteries of this kind, negative correlations have been report- 
ed by different authors. From this analysis one may conclude that 
probably one of the parameters will refer to speed and will involve the 
four tests mentioned above. 

The five sets of the Raven test are highly correlated among them- 
selves, and in general, have higher correlations than the other tests 
with the rest of the variables. In consequence, it is expected that 
their variance will be split into several different factors, one of them 
common to all and the others common with several other tests of 
the battery. As stated in the introduction, this was strongly suspect- 
ed from a simple analysis of the tests and a rather extensive experi- 
ence with it. 

The three variables of the Object Assembly test are not highly 
correlated among themselves. Only test 4 shows somewhat larger cor- 
relations with the rest of the variables, mainly with the sets of the 
Raven test. 

Tests 9, 10, and 7 have very low correlation with the rest of the 
variables; and the fact that tests 9 and 10 have a very low correla- 
tion between themselves is contrary to expectations, a point that may 
be further clarified in terms of the factorial composition of the tests. 

Comparison of the loadings of the tests in “g,” using Spearman 
formula #21 (16), and their saturation in the first centroid shows a 
very close agreement. Nevertheless the residuals after this factor 
has been taken out are still quite large. 


*It will be noted in Table 1 that the signs of variables 1, 12, 18, and 2 have 
been changed. 
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FACTOR B 
a: RR A ose oh i hs -50 
4, Face ...... .45 
OR 0 ER Reece een eerie 41 
13. Triangle .... 34 
8. Porteus ae 
15. Raven B .. 31 
19. Drawing of a Man ................ .30 
SS) beer ee eS See .28 
Se cee ener eae 23 
RG: “WRN RS, oo ee 23 


(Here as in the other factors we shall list all loadings above .20) 

The fact that Raven A and tests 2, 4, and 13 have high loadings 
in this factor seems to indicate that it deals with the perception of 
spatial relations. The impossibility of perceiving the relations be- 
tween the different blocks is a cause of frequent failure in tests 2, 
12, and 13. The subjects usually report that they do not know how 
the different blocks go together, although they perceive their shapes 
correctly. The subjects usually inspect the blocks carefully, handling 
them in different ways, following their contours with their fingers and 
trying to work partial solutions outside the corresponding frames. In 
spite of this exploration, they fail to bring them together in their 
correct relations. 

Many subjects change from a systematically planned way of at- 
tacking the problem to a random activity; others seem to go through 
the problems by means of a methodical exploration of the parts; and 
still others, who begin to explore the parts in a random way, reach 
a certain moment in which the problem clarifies itself (insight?), 
and from there on go smoothly to the solution. The experience of see- 
ing into the problem is accompanied by an evident relaxation of the 
subject. Some individuals who verbalize their actions seem to indi- 
cate by their expression that they are seeking essentially for the re- 
lationships between the parts in order to obtain the required results. 

In test 4, the perception that the blocks should make a profile 
seems to L.elp the subjects, but the position of some of the pieces does 
not become in any way obvious, for instance those of the ear. In test 
8, it is also clear that the subjects have to perceive the relations be- 
tween the different walls of the maze to find their way through. 

In the drawing of a man, success depends not only upon the 
number of the parts drawn correctly but also upon the relations of 
these parts. The older children draw the parts better and relate them 
more logically. The whole problem, at least in so-called normal chil- 
dren, is pervaded by the influence of better articulation of the parts 
toward obtaining a better “gestalt.” 
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The different sets of the Progressive Matrices, with the excep- 
tion of E, have saturations in this factor. Set A requires a percep- 
tion of the spatial relations of the different parts. The subject sees 
a whole which has some part missing. The analysis of the most fre- 
quent mistakes seems to indicate that the “figure-ground” relation- 
ship in many cases may be characterized as “inverted.” 

In Set B, most of the figures-imply the play of strong gestalt 
forces. The wrong answers suggest that the subjects do not see the 
interrelationship of the parts in terms of the total, but merely give 
as a solution one of the figures adjacent to the missing part in the 
main. pattern. In Sets C and D, there are other forces involved and 
the ones mentioned above are less obvious, although present. 

Summarizing, we could say that factor B seems to imply the per- 
ception of relations in space necessary for the construction of a whole. 
Since the tests loaded in factor B are different, this factor probably 
portrays some fundamental central component of psychological ac- 
tivity. 


FACTOR C 
11. Cube Imitation -......0000000..... 45 
10. Cancellation of Figures ...... 44 
7. Digits Backwards ................ 41 
BOs BVOR TB) nc... s cece cccnnk .20 


Test 11 requires the subjects to reproduce a certain temporal and 
spatial pattern. The subject has in front of him a series of. cubes 
ordered in a row. He has to keep in mind the pattern (factor E) and 
reproduce it against the strong configuration given by the linear ar- 
rangement of the cubes. It is interesting to observe that some per- 
sons do not even attempt to solve the problem in the way they are 
asked but tap the cubes in succession. That this is not a simple mis- 
understanding of the order is evidenced by the fact that some of them, 
after having completed some patterns satisfactorily, fail in the more 
complex ones—either by reproducing some previous pattern (some of 
them are strongly preferred by the subjects) or, strangely enough, 
by tapping the cubes in succession. It seems that the subjects have 
to resist strong forces which try to destroy the requested “gestalt.” 

In Cancellation of Figures the subject has a given percept which 
must be individualized among other somewhat similar percepts. Can- 
cellation of letters represents a different problem, since the materials 
employed are heavily loaded with other components, although the 
task is based on an apparently similar performance. 

In Digits Backwards the subject has to repeat the numbers in 
inverse order, therefore making a configuration while keeping in mind 
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the one previously given. The task is carried on with strong field and 
figures forces working against its completion. 

Apart from Raven B, where a similar sort of activity is required, 
the remaining tests, other than those listed, are not loaded jn factor C. 

It is obvious that other tests such as 1, 19, and 6 do not require 
the sort of activity implied by factor C. In tests 2, 12, and 13, al- 
though there is a conflicting force it works in a different way, since 
the situation deals with the spatial relations needed to build a cer- 
tain pattern without conflicting with any outside configuration. Fac- 
tor C, on the other hand, refers to the conflict between two or more 
configurations, the elements inside them being given. The more plas- 
tic the subject, the more he is likely to be “gestalt free’ and the bet- 
ter will be the solution. 

This factor is probably in close connection with Thurstone’s E 
(19) and Meili’s “plasticité” (6), while factor B seems to be related 
to Thurstone’s A (19). New experiments might be devised to further 
clarify this point, probably throwing light upon the dynamics of per- 
ception and the nature of certain “gestalt” forces. Since all the tests 
in Factor C are rather dissimilar, this factor probably transcends the 
content, of immediate perception or performance. 


FACTOR D 
ieee 51 
11. Cube. Imitation ....................... 46 
6 Digits Forwards .................. 34 
Fes, ERE ac eee er ere .28 
S  POPRSUS BRBRC niscsc-csscvcicsccccss 24 
Bey eS a a1 


The interpretation of this factor will require further confirma- 
tion, since it seems to refer to a rather fundamental variable which 
has not been isolated factorially up to the present. 

The problem presented by test 3 was usually solved rather easily. 
The most frequent mistakes are concerned with the wrong location 
of the limbs, in the sense that the right and left sides are frequently 
interchanged. The subject explores both lower and upper limbs, han- 
dling them for a certain while until they perceive the side to which 
they belong. Some others believe that they have solved the problem 
satisfactorily, but after a while change the position of the limbs to 
their correct sides. Those who do not solve the problem usually have 
no difficulty with the head piece, but locate the legs or arms in the 
non-correspondent side of the body, this being by far the most fre- 
quent mistake. Therefore it seems that for the solution of this test 
the distinction of left and right side is necessary. 
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In Cube Imitation the subject has to remember a pattern and 
fight against distracting forces, but he also has to recognize that the 
different cubes are placed in certain relations of left and right among 
themselves and with reference to the subject’s own body. 

In test 6 the same activity may be needed inasmuch as most of 
the subjects represent the numbers spatially. In test 5 (the Hand) 
the most frequent mistakes consist of transposing the order of the 
fingers, especially those which require a finer discrimination of left 
and right—2nd, 3rd, and 4th fingers. 

The same sort of relationship seems to be clear in test 8 where 
the subject is instructed like this: “Suppose this is the plan of the 
paths in a garden. These lines are the walls which you can not get 
over....” It is evident that in such a situation the subject tends to 
think in terms of right and left turns. 

In Raven E a study of the most frequent mistakes indicates that 
many subjects work the problems going in a left-right direction. It 
is interesting in this respect that Raven and Miller (7) have reached 
a similar conclusion. 

It is probable that this factor is connected with other space fac- 
tors, but its significance seems to be different in the sense that the dis- 
tinction of right and left refers to a fundamental bodily dimension. 
The factor is probably connected with the body scheme (Schilder). 
Nielsen (8) says: “It is on the basis of handedness and language 
that the major and minor sides are differentiated and the disturb- 
ances of the body scheme are readily separable into those of the ma- 
jor and those of the minor sides.” There are cases of amnesia for the 
limbs of only one side, and Gerstman’s syndrome includes, among other 
symptoms, confusion of right and left side, not only in the patient’s 
own body but also in that of others. Sometimes the agnosia extends 
to ali animate and inanimate objects. 

The existence and significance of this factor needs better testing 
by means of carefully planned experiments in connection with the 
integration and disintegration of the body scheme. Probably it would 
be more easily isolated in younger ages among normals. 


FACTOR E 
6. Digits Forwards .................. 55 
7. Digits Backwards ................ 35 
pC TER | 2 Ce ee 382 
11. Cube Imitation ~.................. 28 


This seems to be a memory facior. Digits Backwards and Digits 
Forwards have evidently a memory component. Cube Imitation re- 
quires memory inasmuch as the retention of the pattern given is nec- 
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essary for the fulfillment of the task. It is difficult to explain why 
Set E is loaded in this factor. Nevertheless, this set is much more 
complex than the others, and it is not unlikely that immediate reten- 
tion would play an important part. Other tests of this battery do not 
seem to require memory, in the sense here understood. 








FACTOR F 
1 Seguin-Goddard .................... 51 
12 Diagonal 40 
19 Drawing of a Man ................ 34 
OS Te a 32 
10 Cancellation of Figures ...... 31 
RB PRR URR SS: site et en .29 
Lf 7 ERS reeenens ptanee cone a 28 
5 Hand 26 
2 Healy Construction A .......... 23 


Tests 1, 2, 12, and 13, are scored as a function of time. Better 
solution means shorter time. In Cancellation of Figures the subjects 
have to perceive a certain given figure while working at the highest 
possible speed. In Sets E and D of the Progressive Matrices, the 
speed with which certain details are perceived seems to be important 
for the solution. In many cases the subjects fail to perceive certain 
characteristics of the items and the solution is wrong. The problems 
in these sets seem to be different from the problems in the other sets, 
although some items of Set C seem to require the same sort of ac- 
tivity. It is our experience that many subjects can not solve these 
problems, although they devote a long time to them. A quick per- 
ception of details is apparently missing, as can be corroborated by 
going over the test following its completion. Once the perceptual de- 
tail has been pointed out to the subject, the solution comes easily. 

It is interesting to observe that tests 2, 12, and 13 appear in fac- 
tor B as well, but with this characteristic: the more they are satu- 
rated in B the less saturated in F and, vice versa. In test 2 the per- 
ception of relations has a large share in the variance of the test. In 
test 12 it is usually observed that the more rapid the perception of 
the trapezoidal piece the easier the solution. In test 13 the problem 
is facilitated once the identity of the pieces is apprehended. The 
average times for the solution of these three tests are the following: 
No. 2, 92”; No. 13, 83”; No. 12, 33”—thus adding further weight to 
the interpretation that this factor probably deals with speed of per- 
ception. 

Thurstone (19) has described a factor similar to the one here 
mentioned. He says that in this factor “the subject has no real per- 
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ceptual problem,” adding later on “but we have no evidence that the 
factor is quite so simple.” This factor seems to agree with what we 
have expected from the analysis of our correlation table. 


FACTOR G 
1 Seguin-Goddard .................... 85 
8 Porteus Mazes ...................... ol 
9 Cancellation of Letters ........ .30 


It is not advisable to attempt an interpretation of this factor, 
since it appears in only three tests and has small loadings in all of 
them. 














FACTOR A 
17 Raven D .68 
16 Raven C 55 
15 Raven B 54 
18 Raven E 45 
14 Raven A 34 





All the other variables apart from the five sets of the Raven test 
have practically zero loadings in this factor, which therefore appears 
as common to all the sets of this test and corroborates the analysis of 
the correlations. 

This series of matrices has been considered by its author and by 
Spearman as highly saturated in “g.” 

The careful analysis of the 60 items will convince the reader that 
the noegenetic principles have been taken into consideration in its 
construction. According to Spearman (15), “g” is a combination of 
both noegenesis and abstraction; and, referring to the test under con- 
sideration, he says that it can be solved in two ways, viz, analytically 
and synthetically, and that the former procedure, not the latter, tends 
to load noegenetic processes with “g.” 

The contribution of other factors to the variance of the Raven 
test has already being studied. Some of them may refer to what Spear- 
man calls synthetic activity. 

One possible way of interpreting factor A would be to regard it 
as similar to Thurstone’s J, since the subjects’ reports indicate that 
they try to find a “rule or principle for each item of the test.” In this 
sense the test will probably require the analytical activity mentioned 
by Spearman. Owing to the design of the present battery, these points 
can not be tested in this study and moreover would lead to the analy- 
sis of the complex relations between “g,” J, R, and D, which have 
been dealt with elsewhere (14). Accordingly, it is preferable at this 
time to refrain from giving any interpretation of Factor A. 
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Study of the Second Order 

Some of the correlations between the primaries are negative—a 
fact which may depend upon the selection of the subjects (20). Never- 
theless, it is difficult to explain how factors can be negatively corre- 
lated in the field of intelligence. By studying Table 6 one realizes that 
most of the negative values correspond to factor F, which according 
to the previous interpretation seems to be connected with speed of 
perception. Since this is the only factor apparently related to speed, 
it would not be too difficult to explain the values of the correlations 
between this primary and the rest of them. 

There are few studies with reports about the second order. In 
a study not yet published (14), the writer employed the technique of 
expressing the original variables in terms of the second-order factors. 
The experience seems to indicate some interesting findings, although 
one must be cautious about giving premature judgment in the matter. 

The basic logic underlying this extension of factor theory resides 
in the fact that if the primaries are correlated, their variance should, 
theoretically at least, be explainable in terms of common variance, plus 
specific and error variance. The factors thus obtained probably refer 
to more fundamental psychological variables, since they are the ex- 
pression of the correlations between the primaries. Expressing the 
tests in these new factors, one may get an idea of how their perform- 
ance clusters around these second-order factors. This technique raises 
many questions not yet thoroughly answered in factorial theory. 

Since part of the variance of the primaries (uniqueness) has 
been lost in the process of factorization, it is not to be expected that 
the original correlations will be exactly reproduced by means of the 
loadings in the second-order centroids. In the process of rotation it 
is advisable to look for the best fit for both factors and tests, without 
expecting a perfect agreement, for the reasons mentioned above. 


FACTOR y 
NE UR heer A es ee 30 
Le ans nectar -76 
i ge, | SERCO nee mere Roemer ect .62 
PTO NG ieee ee tata eo 


Factor B was interpreted as perceiving relations in space neces- 
sary for the construction of a whole, which suggests that this factor 
is of a rather complex nature. 

As can be seen by studying Table 12, all the tests saturated in B 
have loadings in the second order factors y and a. 

Although Factor A was not interpreted, it seems to be related 
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to some sort of analytical activity, and all tests loaded in it have satu- 
ration in y. 

Factor D may well require a discriminative analysis, judging 
from the subjects’ reports. Factor G was not interpreted. 

Very likely factor » represents a sort of analytical activity. Ac- 
cording to the figures in Table 12, it is rather general in scope and, 
by the part it plays in both factors and tests, seems to be present in 
different kinds of psychological performances. Probably it is related 
to what has been called abstraction and indicates a rather funda- 
mental dimension of psychological dynamics, such as capacity to ab- 
stract from complex presentations. 

If this interpretation is correct, it is understandable why factors 
F, E, and C are not loaded in y. 








FACTOR «a 
Factor B 57 
Factor F -26 
Factor G —.22 





It is not possible to interpret this factor with certainty. Accord- 
ing to the previous description, Factor B seems to be related to Thur- 
stone’s A (19) and probably to the factors described by Mieli (6) as 
“complexité” and “globalization,” which the author defines as: “ca- 
pacité de réaliser clairement et avec précision des structures com- 
plexes” and “. . . la facilité avec laquelle des données relativement 
separées s’unissent pour former un tout.” These two factors, accord- 
ing to Mieli, may go together but may also be opposed to his factor 
“oplasticité,” which is similar to our Factor C. 

F has a small loading in a, and probably for the perfect opera- 
tion of this factor a certain amount of syntheses of the given per- 
cept may be needed. 

Therefore, and only as a tentative explanation, this factor could 
be considered as expressing a certain synthetic (syncretic) activity. 


FACTOR B 
Factor D —.73 
Factor E 42 
Factor G 48 











This is a bipolar factor, one of whose extremes is represented by 
the memory factor. At the present moment it is not possible to inter- 
pret it. 
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Summary 

From 19 variables we have extracted seven factors and inter- 
preted only five of them. 

Factor D, which seems to be concerned with the relation of right 
and left, needs further investigation. 

Factor A can not be easily identified in terms of the present bat- 
tery. 

Factor B seems to imply the perception of relations in space nec- 
essary for the construction of a whole and is probably related to 
Thurstone’s A and to Mieli’s “complexité” and “globalization.” It is 
a rather complex component and its significance seems to transcend 
the actual test context. 

Factor C is probably related to the difficulties encountered in the 
construction of a gestalt, when there are disturbing forces. It is 
clearly differentiated from factor B and seems to be related to Thur- 
stone’s E and to Mieli’s “plasticité.” 

Factor F has been interpreted as speed of perception and is 
probably similar to Thurstone’s factor of the same name. Since the 
last three mentioned factors, or their counterparts, have appeared in 
different studies employing different tests and populations, it would 
seem that they represent certain basic psychological concepts. 

Factor E has been considered as an inmediate memory factor. 
Factor G has not been interpreted. 

Of the second-order domain factors, none has been definitely 
identified. It would seem that factor a represents some sort of syn- 
thetic activity, while factor y represents the capacity to abstract from 
complex presentations. Probably these two processes play a part in 
all intellectual activities, although their relative weight may vary ac- 
cording to the circumstances. 

Factor f£ has not been interpreted. 

It is evident that the factorial composition of this battery reveals 
the existence of space and perceptual factors plus one of speed. The 
interrelations between space and perceptual factors is complex and 
probably the stress should be laid more on the understanding of their 
interplay. 

Some of the names usually given to factors are related more to 
the special make-up of the tests than to the psychological activity 
going on in the subjects. Very probably factors will have to be classi- 
fied in terms of their psychological nature. There is a suspicion that 
some of them, such as the verbal and numerical ones, would repre- 
sent the ability to deal with symbols. This ability would possibly be 
different from the one represented by the other factors and would be 
quite genera] in scope. 
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Test 
No. I 
1 370 
2 404 
38 268 
4 559 
5 390 
6 391 
7 $320 
8 578 
9 161 
10 8 135 
11 8410 
12 270 
13 411 
14 653 
15 650 
16 §=©671 
19 Tel 
18 512 
19 461 
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TABLE 2 


The Centroid Factorial Matrix* 


-157 
334 
205 

—220 

—332 

-317 

-313 

-157 
214 


III 


204 
128 
295 
065 
231 
102 
—411 
057 
—209 
—229 
—237 
194 
058 
—-192 
-211 
-166 
148 
199 
093 


IV 


-122 
067 
-349 
115 
—064 
170 
110 
—-196 
-110 
-173 
—424 
300 
—038 
189 
074 
216 
151 
166 
—088 


v 


—024 
114 
—032 
146 
—105 
-351 
-279 
202 
077 
-142 
—263 
-071 
165 
170 
192 
094 
179 
-197 
117 


VI VII 
-187 -150 
161 098 
118 160 
288 -093 
091 -053 
313 -155 
0382 -081 
179 -146 
-073 -249 
-201 217 
159 = 058 
-054 110 
—-028 170 
1385 169 
-160 132 
-153 -114 
-322 034 
-243 -129 
—026 038 


* The decimal point has been omitted for all the entries. 


I 

II 
III 
IV 
Vv 
VI 
VII 


The Final Transformation Matrix 


A 


32 
-.56 
-.02 

35 

00 
-.66. 

14 


TABLE 3 
B Cc D 

41 .20 8 .25 
19 12 —.26 
01 -.64 48 
28 -.25 -.56 
65 -45 -.56 
44 -.09 .26 
32 


E F G 
23 86.8618 
-01 60 31 
05 387 # .00 
27 -08 -.39 
-.88 -.16 .32 
24 -.55 -.09 


50 02 -18 19 -.77 

















H. J. A. RIMOLDI 43 


TABLE 4 
The Rotated Factorial Matrix* 


Test 
A B Cc D E F G 


+1 04 -03 11 06 51 = 35 
-08 41 -01 08 04 28 OF 
06 05 OO 51 O1 O08 -05 
: -01 05 11 -04 14 
-02: 12 038 28 20 26 12 
00 05 O1 384 55 -02 -04 
01 00 41 -01 35 08 04 


CWMNATR WON Z 
~ 
o 


19 04 30 08 OF -038 34 19 


* The decimal point has been omitted for all entries. 


TABLE 5 
Cosines of Angles Between Reference Vectors 


A B Cc D E F G 


99 
-.12 1.00 
05 -14 .99 


-15 -34 .12 1.01 

-01 -.36 .238 48 .99 

13 -04 .14 09 .05 1.00 
-.30 -.06 -.35 -.04 -.23 13 .99 


QA DAWP 


TABLE 6 
Correlations Between the Primary Vectors 


A B Cc D E F G 


1.00 
24 1.00 
11 =.14 «=1.00 


21 .24 .02 1.00 

07 .27 -.06 -.37 1.00 

-.24 -.09 -.20 -.10 -.06 1.00 

38 .27 87 04 .24 -.27 1.00 


QAnVTAwP 
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TABLE 7 
Loadings of the Primaries in the 
Centroids of the Second Order 
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Rotated Factorial Matrix for the Pri- 
maries in the Second Order 


Rotated Factorial Matrix for the 
Tests in the Second Order 


Y a diy 
A 49 -12 -.10 a 
B 86 2 s2 B 
C 29 02 -—.14 4 
D 281 -—70 .86 
E 14 44 -.04 
F -38 .08 .25 
G 15 22 —387 
TABLE 8 
Loadings of the Tests in the Centroids nN 
of the Second Order B 
, , , fs 
I 1 6af D 
16 620 6 OO E 
2 26 26 @ F 
8 21 -43 .27 G 
£ 2B a7 “21 
5 .2 -.03 .19 
6 2 02 .16 
7 20 22 -.09 
8 61 -04 .17 
> 2 ji1 -<17 
10 -.04 .02 -.01 
11 6.85 -.26 .05 4: 
12 -08 .22 .23 2 
3 2s 238 25 3 
144 6S8lC«Cdd eS 4 
15 638 .00. .11 5 
16 .55 10 ~ = .03 6 
17 51 -.04 18 7 
18 .24 -.05 .06 8 
s9 s31 28 2 9 
10 
11 
TABLE 9 12 
Final Matrix of Transformation for 13 
the Second Order 14 
I’ 00 15 .70 
I’ 48 .85 23 ae 
Ill’ 88 -50 .68 8 


TABLE 10 
Cosines of Angles Between Preference 
Vectors in the Second Order 


a. B 
1.00 
-03 .99 
49 -.42 
TABLE 11 


a B 
-15 .02 
57 04 
-11 .13 
-.02 -.73 
18 .42 
.26 -.11 
-—22 «.48 
TABLE 12 


a B 
08 .08 
33 8§©6.08 
03 -.47 
mt | |6Ae 
15 -.09 
15 -.03 
03 .26 
13 -.03 

-.10 .22 
00 .02 
-.08 -.19 
31 36.07 
26 -.01 
27 ~~ ~=.05 
10 02 
07 .15 
14 -.05 
.03 -.04 
21 Ol 





Wf 


1.00 


Y 


Y 


15 
34 
43 
AT 
30 
.26 
.03 
55 

.03 
-.04 


34 


— 











—— 








10. 


11. 


12. 





H. J. A. RIMOLDI 45 


TABLE 13 
Loadings of the Tests as Determined 
by Using Spearman’s Formula 


1.351 
2 402 
3.224 
4  .559 
5.886 
6 871 
7 = .285 
8 .576 
9 .144 
10 = .097 
11.882 
12 = 241 
138 —-.408 
14 648 
15.633 
16.656 
17 ~—-.736 
18 .495 
19 .460 
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BOOK REVIEWS 


HENRY E. GARRETT. Statistics in Psychology and Education. New York: Long- 
mans, Green and Co., 1947 (Third edition). Pp. xii'+ 487. 


In his third edition of Statistics in Psychology and Education, Garrett has 
made several noteworthy changes. Two new chapters entitled “Testing Experimen- 
tal Hypotheses” and “Multiple Correlation in Test Selection” (a description of 
the Wherry-Doolittle technique) are included. Among some of the other impor- 
tant additions are a discussion of small sample statistics, along with a limited 
consideration of analysis of variance, the substitution of the chi-square test for 
the somewhat outmoded Blakeman test for curvilihearity of regression, a highly 
abbreviated treatment of the method of “rational equivalence” in the determin- 
ation of a reliability coefficient, a unit concerning the multiple correlation 
coefficient in terms of the sum of the products of beta coefficients and respective 
zero-order correlation coefficients, and a section upon the accuracy of group pre- 
dictions based on an article by Taylor and Russell (The relationships of validity 
coefficients to the practical effectiveness of tests in selection: discussion and 
tables, Journal of Applied Psychology, 23, 1939, 565-578.) For the determination 
of the reliability of statistics, new tables of t at the five per cent level and one 
per cent level for different degrees of freedom are incorporated. 

As expected, the clarity of the exposition remains at a high level. Pedagog- 
ically, the text deserves highest commendation. Numerous illustrative examples 
in most sections serve to clinch the points the author attempts to make. An ample 
supply of problem exercises along with answers renders the text highly teachable. 
As in previous editions, no formal training in mathematics beyond intermediate 
algebra and elementary geometry is necessary to one’s comprehension of the sub- 
ject matter. Contrary to the practice of many writers in statistics, Garrett does 
not overestimate the mathematical background of his readers. 

Despite the many favorable features of the text, one does gain the impression 
that the author’s viewpoint is conservative—perhaps ten or twelve years behind 
current practices employed in much educational and psychological research. The 
highly mechanical discussion of analysis of variance is inadequate as evidenced 
by the fact that only eleven pages are devoted to this important topic. Little is 
said concerning the assumptions underlying the procedures employed, and noth- 
ing apparently is said about the limitations, or weaknesses, of the analysis-of- 
variance approach. Only the “one way” classification is treated. Even more 
disappointing is the omission of analysis of covariance. 

Further evidence of the author’s conservative outlook is the lack of any dis- 
cussion of factor analysis. Any attempt on the part of the student to consider 
test reliability and test validity in terms of factor pattern is therefore impossible. 
Recent work concerning the relationship of test reliability and/or test validity 
to the range of difficulty of items and their degree of intercorrelation receives 
no attention. Although the chapter on “The Reliability and Validity of Test 
Scores” is well written, it does not represent any marked changes over the mode 
of presentation in the Second Edition (1937). Surely, the recent experience gained 
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by psychologists during World War II in the areas of reliability and validity of 
tests offers ample opportunity for a broader consideration of these topics in the 
revision. 

In addition to a possible criticism of the relatively great emphasis placed up- 
on large sample statistics, attention ahould be called to the erroneous manner 
in which probability statements are employed concerning the fluctuation of pop- 
ulation parameters. Although the author frequently makes correct assertions 
concerning the probability that a statistic in random samples will vary within a 
certain interval above and below the value of a population parameter, he inter- 
polates, for example, such contradictory statements as, ‘“‘We may be confident 
(probability is .95) that the population ¢ is not larger than 2.61 nor smaller 
than 2.58 inches (2.57 + 1.96 X .02),” (p. 194, lines 26-28). The obtained sample 
standard deviation is actually 2.57. (Moreover, the distribution of sample sig- 
mas is not “normal.”) Other places where the same type of error occurs are: 
p. 185, lines 18-22; p. 185, lines 27-31; p. 195, lines 10-18; p. 203, lines 1-3; 
p. 207, lines 25-27; p. 219, lines 15-18 and p. 392, lines 29-81. Apparently, the 
author has taken, in these cited instances, statistic values as “best estimates” 
of population parameters without stating explicitly such a step. The inexperi- 
enced student is likely to be confused when Garrett shifts from a discussion of 
the range of values of an obtained sample statistic about a population parameter 
within certain fiducial limits to the erroneous approach just described. 

Another misleading point arises in the consideration of the standard error 
of the difference between two percentages (p. 220). Garrett has failed to pool 
the two-p values of both groups as his best estimate of the population value of 
p in testing the null hypothesis. 

Much needed are reasonably detailed, or at least adequate, definitions of 
such terms as degrees of freedom, t-distribution, fiducial limits, confidence in- 
tervals, null hypothesis, representative sampling, and biased sampling. For ex- 
ample, the definition of the null hypothesis is restricted to a zero difference in 
the instance of the difference between two means rather than extended to any 
prescribed difference which the experimenter wishes to test. In the treatment 
of skewness and kurtosis some reference to third and fourth moments might 
appear desirable. 

The reviewer was considerably disturbed at the misleading statement on 
page 224, “...and in small samples, wide deviations from the mean cannot appear 
if the sample is truly representative of a normally distributed group. When 
working with small samples, therefore, deviations far removed from the mean 
should be discarded much as a laboratory worker throws out measures of re- 
action time which are obviously premature or delayed.” Obviously, some of the 
wide deviations represent rare events which can be expected to occur at least 
once in several replications of an experiment. What could be potentially more 
undesirable than having the inexperienced student throw away certain data in 
order to make his observed results conform to values he had anticipated or de- 
sired! Garrett fails to point out any standard for the rejection of obtained data 
and to warn of hazards involved. 

In the next paragraph a somewhat less startling error—perhaps the term 
“slip” is appropriate—is the implication that two “independently drawn” samples 
with approximately the same means and the same standard deviations are repre- 
sentative of the same population. Obviously, the identity of the two respective 
statistic values in the two samples could be due to a common bias. 

Other minor slips include underestimation in the calculation of the range 
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(measure of ,variability) by one point, use of | =«| when =|z| is intended, and 
mislocation of the symbolic multiplication dot in products of numbers entering 
into two factorial expressions (p. 286): 10! and 5! 

Although there is sufficient basis for considerable criticism of the Third Ed- 
ition, it probably still remains one of the better texts in the field. Its lucid expon- 
itory style and its pleasing format will continue to appeal to many instructors 
and students. However, the student requiring a substantial background in 
quantitative methods will find it necessary to consult other standard texts com- 
monly used in courses open to advanced seniors and graduate students. Forth- 
coming revisions of the other texts in the field, as well as the appearance of new 
books with a somewhat more forward-looking point of view, will challenge the 
position of leadership and prestige that Statistics in Psychology and Education 
has held for so many years. 

WILLIAM B. MICHAEL 


Princeton University 


DOROTHY C. ADKINS with the assistance of ERNEST S. PRIMOFF; HAR- 
OLD L. McADOO; CLAUDE F. BRIDGES; and BERTRAM FORER. 
Construction and Analysis of Achievement Tests: The Development of Written 
and Performance Tests of Achievement for Predicting Job Performance of Pub- 
lic Personnel. Washington: Government Printing Office, 1947. On sale through 
the Office of the Superintendent of Documents. Pp. xvii + 292. $1.25. 


This book, originally conceived as a training manual for personnel of the 
United States Civil Service Commission, was written to provide “an understand- 
ing of how a test fits into a total examination, of how to plan test content or how 
to determine what items should be constructed, of how items behave in combina- 
tion with each other and with other parts of the examination, of how to meas- 
ure job success, and of how to tell whether items are, in fact, contributing to the 
prediction of job success.” These objectives are not conceived narrowly, and con- 
siderable space is devoted to certain elementary statistical procedures used in 
the construction and interpretation of tests. 

The first chapter, titled “Planning a Written Test,” stresses the need for 
preparing a test outline based on the pooled judgment of experts before item 
construction is begun. It is gratifying to find a discussion of this topic, which 
has been so often neglected. Although the authors are well aware of the fact 
that the contribution of an item to the variance of a weighted sum (such as 
the total score of a test) is dependent in part on the magnitude of the item 
variance, which is a function of item difficulty, this fact does not seem to have 
been related to the suggestion on page 29 that if “items in one area are so easy 
that they will be passed by about 80 per cent of the competitors, [the test con- 
structor] can arrange in an orderly fashion to counter-balance these by more 
difficult items in another area.” A word of caution might have been appropriate 
at that point, because the suggestion would result in balancing average item 
difficulty at the expense of reducing the relative effective weights of items in 
both areas in determining the total test score. Such deviations from the weights 
prescribed by the test outline could only lead to impairment in some degree of 
the test’s validity, as defined by the outline. 

A large number of other considerations in planning a test are also discussed, 
including some of the practical considerations involved in getting test technicians 
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and subject-matter experts to collaborate and in making job analysis of the type 
especially needed for tesi-construction purposes. 

In Chapter II the authors discuss the construction and compilation of writ- 
ten tests. Excellent suggestions for item writers are presented, though these do 
not by any means exhaust the topic. Apparently, the field is still open for a de- 
finitive treatment of the writing and editing of objective test items. The reviewer 
was happy to note the description of the item writer’s job as “essentially a 
highly creative task.” The authors are apparently aware of the costly statis- 
tical treatment often (and unfortunately) lavished on intrinsically shoddy items. 
The reviewer would have made more explicit than the authors have (on p. 41) 
the necessity for making sure that the keyed answer to a multiple-choice item 
is not only the best answer presented but is also the right answer, able to stand 
alone on its merits as precisely correct. The authors quite properly refer to the 
need for making each individual item as reliable as possible. They do not, how- 
ever, mention the fact, demonstrated by Horst many years ago,* that to maximize 
item reliability the incorrect choices in a multiple-choice item (where the correct 
answer is the most popular choice and the choices represent a graded series of 
steps of knowledge of the point being tested) should be made of equal attrac- 
tiveness to the subjects. 

Chapter Iil, titled “Basic Statistical Tools,” is, in the opinion of the reviewer, 
not quite up to the standards of the rest of the book. In fact, because it seems 
likely that many of the statements and concepts in this chapter will be subjected 
to critical review by statisticians, this review will merely describe the chapter as 
a whole and cite only a few examples of the inaccuracies and oversimplifications 
that will prove irritating to statisticians. 

The topics included in the chapter are all of value to anyone concerned 
with test construction or interpretation. Frequency distributions are discussed; 
measures of central tendency, variability, and correlation are described and eval- 
uated; facts concerning the normal probability curve are presented; and the 
standard errors of various statistical measurements are defined. In addition, 
there is a brief discussion of some problems of sampling. All of this material is 
clearly presented and is illustrated with some computational routines. The re- 
viewer’s objections arise from what seem to be avoidable oversimplifications. For 
example, the mean is specifically referred to (on pages 88, 90, and 91) as the 
most reliable measure of central tendency. And so it is in many circumstances. 
But not always.+ Analogous objection may be made to the following statement 
on page 95: “The standard deviation is the best measure of variability.” 

On page 130 the estimate of the standard error of a mean is defined as 


o 
——. This is a close approximatien, to be sure, but the reviewer sees no reason 


VN 


o 
for not presenting the equally simple unbiased estimate: —_———. This is par- 
VN—1 
ticularly true since the numerical illustration given by the authors will be found 
to yield 1.18 instead of 1.12 if VN — 1 is used in the denominator. (This change 
in the second decimal place is of no practical significance, of course, for the data 
at hand.) 


‘ - Horst, A. P. The difficulty of a multiple-choice test item. J. educ. Psychol., 
4, 229-232. 

; Vid., Kelley, T. L. Fundamentals of statistics. Cambridge: Harvard Univ. 
Press, 1947. Pp. 242-243. 
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The method of determining a confidence interval for the product-moment r 
that is presented on page 133 will probably annoy many statisticians who will be 
quick to point out that if the student cannot be expected to make use of David’s 
tables* he can at least be expected to make use of Fisher’s z transformation for 
solving problems of this type. In any case, a better estimate of the standard error 


1-—r 1—r 
of r than is , especially when N < 50, as in the illustrative 


VN vVN—2 
numerical problem given by the authors. 

The reviewer was sorry to note that the concept of statistical significance 
is presented on pages 182-183 in terms of rigid critical-ratio requirements. As 
a matter of fact, though the authors describe a critical ratio of 2-3 as not in- 
dicative of a significant difference but only “suggestive of a true difference,” 
statisticians rarely establish levels of confidence beyond the 1% level (which cor- 
responds to a critical ratio of 2.58 when both tails are considered). The reviewer 
suggests that Chapter III be compared with the presentation of similar content 
by Dr. Armen A. Alchian in Part III of Records, Anaylsis, and Test Procedures 
edited by Dr. Walter L. Deemer.** 

In chapter IV the authors have considered a number of problems and pro- 
cedures encountered in the analysis of test results. Types and uses of norms are 
discussed, techniques for determining test reliability and validity are compared 
and evaluated, and a method of item analysis is presented. Practical procedures 
used by the U. S. Civil Service Commission for transmuting test scores are de- 
scribed. A good many of the points discussed in this chapter are familiar ground to 
test technicians but have not heretofore been treated in references suitable for use 
by students of mental measurement. It is good to see them made accessible in this 
book. One criticism that might be made is, perhaps, that certain procedures are 
presented rather uncritically. It is interesting to know what technique of item 
analysis is used at the U. S. Civil Service Commission; it would be valuable to 
know what, if any, modifications in the technique could, on the basis of the ex- 
tensive experience of test-construction experts on the staff of the Commission, be 
expected to result in greater convenience and efficiency. In discussing reliability 
coefficients the authors state that the superiority of the Kuder-Richardson pro- 
cedures over other methods “is generally recognized.” While the reviewer would 
insist on the value and convenience of the various Kuder-Richardson formulas, 
he objects to so sweeping a generalization. When suitable validity coefficients are 
available for a test, the reliability coefficient is useful mainly for research pur- 
poses in revising the test or the battery of tests of which it is a part. The review- 
er believes that for research purposes, with the short, relatively homogeneous 
tests that will come more and more to be used in combination to measure a good 
many slightly correlated human traits, the product-moment correlation coefficient 
between two parallel forms carefully constructed to measure similar functions 
has a good deal to recommend it as a measure of reliability. Such a reliability co- 
efficient is easy to compute by the sums or differences formulas and, of great con- 
sequence for research purposes, it is subject to precise tests of significance. It is 
as economical to compute as is a Kuder-Richardson coefficient obtained by one of 
the more precise formulas and does not require assumptions as untenable as those 
demanded by the use of the simpler Kuder-Richardson formulas. 








* David, T. N., Tables of the ordinates and probability integral of the dis- 
tribution of r in small es en Cambridge: Cambridge University Press, 1988. 
** AAF Aviation Psychology Program Research Reports, No. 18, Washington: 


Government Printing Office, 1947. 
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An excellent feature of chapter IV is the emphasis on the relationship of test 
validity and test reliability. Very few psychologists realize how useful carefully 
constructed short (and therefore relatively unreliable) tests can be, especially 
when they are used in combination to yield weighted composite scores. The sooner 
we can get test constructors to realize the absurd inefficiency of using tests with 
reliability coefficients approximating .90 to .95 when their validity coefficients are 
of the order of .30 to .40, the quicker progress will be made in laying the basis 
for practical measures of value in educational and vocational guidance. 

The final chapter in the book presents a remarkably complete discussion of 
the problems encountered in developing performance tests. This chapter will be 
of special interest to psychologists in industry and in public-service agencies. 

Reading this book may make many people realize for the first time what a 
complicated and specialized task it is to construct examinations properly. It will 
be encovraging to laymen to realize that we have psychologists in Federal agen- 
cies, especially in the U. S. Civil Service Commission, who are abreast of develop- 
ments in psychometrics. The reviewer hopes that the technical staff of the U. S. 
Civil Service Commission, which has been cut to the point where there is actually 
no group of item writers and test constructors available to write the examinations 
that have to be constructed to select employees for the Federal service, will be 
built up to adequate size. Readers of Psychometrika should perhaps constitute a 
nucleus of support for providing sufficient personnel to permit the scientific 
preparation of civil service examinations. 


George Peabody College for Teachers FREDERICK B. DAVIS 


CARL E. SEASHORE. In Search of Beauty in Music. New York: The Ronald 
Press Company, 1947. Pp. xvi + 389. 


Each chapter of this book, according to the author’s preface, has appeared 
in whole or in part in some leading musical, scientific or educational journal. 
“It is an attempt to integrate my interpretive and popular articles on research 
in the psychology of music which have more or less direct bearing on the problem 
of esthetics.” It is directed to “advanced students of music and psychology, music 
teachers, educators, professional musicians and general readers interested in the 
scientific approach to the understanding and appreciation of beauty in music.” 

The psychologist will find it convenient to have collected into one volume 
many of the more important Iowa studies in music which have been available 
previously only in scattered monographs (and some of them now out of print), 
but it is the musician and the general reader who will probably find the kook most 
rewarding. The author declares that he has tried to present each topic “in readable 
and newsy style.” The frequent recapitulation of psychological principles and ex- 
perimental hypotheses “is carried for the purpose of a progressive establishment 
of relationships.” 

Readers of Psychometrika will find most useful and profitable a half dozen of 
the chapters in Part Two. These chapters describe the apparatus devised by Sea- 
shore and his students and colleagues in the Iowa laboratory for the study of tone 
quality, timbre, and sonance; for recording and analyzing the vibrato; for record- 
ing piano performance (the piano camera); and for portraying the spectra of 
various kinds of violin tone. The chapters on Aspects of Beauty in Song, which 
describes an experiment in portraying emotion by voice fluctuations, and Musical 
Anthropology, an account of the use of phonophotography in folk music, are also 
of interest. 
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For those interested in psychometrics, this volume does not carry a satis- 
factory account of the measurement of musical talents and of musical achieve- 
ment. The revision of the famous tests is described only in very general terms, 
with no mention of the statistical data to establish their validity and reliability. 
The arguments in support of them are recapitulated from a reply to Mursell’s 
criticisms in 1937. The measurement of musical achievement is likewise adapted 
from an article in The Music Educators Journal in 1940 and is of very general and 
non-technical nature. 

Dean Seashore makes no pretense of describing or incorporating the work of 
scholars outside his own laboratory — even of those who pioneered in some of 
the techniques later extended and perfected at Iowa. The work of Sorantin, Riggs, 
Heinlein, and others, and especially of Schoen in tone quality and mood effects, 
are not mentioned. Gundlach’s study of the primitive music of Indian tribes, 
Whipple’s early study of piano techniques through the player piano rolls, Moore’s 
and Farnsworth’s work in consonance, and findings of many other experimenters 
whose work has made history in this field are not found in these pages. In spite 
of the rather general title, the book is rightfully an account of the research pur- 
sued at the University of Iowa through a quarter century, together with some 
very readable and stimulating comment and discussion on them. 


Indiana University KATE HEVNER MUELLER 


GEORGE G. KILLINGER(Ed.). The Psychobiological Program of the War Ship- 
ping Administration. Applied Psychology Monographs, No. 12, 1947. Pp. 351. 


This monograph has been prepared by the personnel who participated in the 
Psychobiological Program of the War Shipping Administration to give a histori- 
cal resumé of their activities during the past war. It is necessary to note that 
this program was essentially a service program and was a part of the psychiatric 
activities of the Medical Director of the WSA. The chief functions of the psy- 
chologists were to aid the psychiatrists in the screening of emotionally unstable 
applicants and in psychotherapy. In addition, there fell to them certain other 
tasks such as the preparation and administration of courses in mental hygiene for 
purser-pharmacist’s mates, the psychological evaluation of officer candidates, and 
leadership training. 

The first major section of the monograph describes the Psychobiological Pro- 
grams at each of three U. S. Maritime Service Training Schools and two Cadet 
Schools. The second section is concerned with the effectiveness of the Maritime 
Service Inventory as an aid to psychiatric screening, and the prediction of sur- 
vival in cadet school. The third section is concerned with a description of the 
characteristics of various groups of Merchant Seamen. The fourth section reviews 
the mental hygiene activities. Two smaller sections are concerned with employ- 
ment and statistical records. Appendices present the items contained in the vari- 
ous forms of the Maritime Service Inventory, interview forms, and an outline of 
a course in human behavior and leadership. 3 

As an over-all report of the WSA program this monograph is most satisfac- 
tory. The scope and nature of the activities of the psychologists engaged in it are 
quickly and easily grasped. The reader is not burdened by involved statistical 
tables or by descriptions of details. The most complete analyses presented are 
relative to the Maritime Service Inventory in the prediction of psychiatric diag- 
nosis of emotional instability. To a considerable extent the lack of lengthy statis- 
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tical analyses is due to the fact that the primary function of the program was 
service and cljnical in nature. 

But this very lack of detailed analysis and description makes it a difficult 
task for the reviewer to evaluate the WSA program from this report. It is ap- 
parent that concentrated and systematic attacks were made on the major prob- 
lems which fell within the province of the psychologists. As well recognized by 
the participants in the program, better assessments of prospective seamen and 
cadets could have been made had the pressure of the situation been less. However, 
the reviewer cannot help being impressed with the quality and quantity of in- 
formation that was obtained on each candidate by means of short questionnaires 
and interviews. 


University of California EDWIN E. GHISELLI 


MUZAFER SHERIF and HADLEY CANTRIL. The Psychology of Ego-Involve- 
ments. New York: John Wiley and Sons, 1947. Pp. vi + 525. 


Defining ego as that constellation of attitudes which “define a person’s status 
or that give him some relative role with respect to other individuals, groups, or 
institutions” (p. 96), Sherif and Cantril review an enormous literature bearing 
on the formation of the ego in childhood and its constant modification as the in- 
dividual undergoes the socialization process. Rejecting on methodological grounds 
the frequently insightful but pat concepts of psychoanalysis, the authors of this 
volume formulate a dynamic point of view in which the ego is conceived as a 
thing in process, changing as the individual’s experience widens and breaking 
down under internal and external pressures ranging from organic pathology to 
unbearable economic pressures. 

Descriptively adequate, the volume seems to fail in its chief task, the ea- 
planation of ego-involvements. If the crucial nature of role-defining attitudes is 
granted, it becomes necessary to ask “how” and “why” questions about their ac- 
quisition. This requires some attention to the dynamics of learning. But Sherif 
and Cantril, though not unaware of the matter (p. 30), insist that they need not 
take sides in favor of any learning theory and consequently leave the problems 
most vital to ego psychology essentially neglected. 

In terms of what they have done, however, Sherif and Cantril have organized 
a treasure-house of ideas and facts into a firm descriptive structure, demon- 
strated a refreshing willingness to cut across academic boundary lines (see their 
illustrations from literature, pp. 437-460), and displayed a clarity of presenta- 
tion that is most gratifying. The volume setting forth an explanatory psychology 
of ego-involvements, however, is still to come. 


State University of Iowa EDWARD JOSEPH SHOBEN, JR. 
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COMMENTS 

As may be noted from the report, the Corporation took in $3,341.64 and paid 
out $1,765.49 during the fiscal year ending June 30, 1947, leaving a surplus of 
$1,576.15. When added to the previous accumulated surplus, this brought the June 
30 balance to $8,144.46. 

Business reached its all-time high in 1946-47. On August 31 the Society had 
251 paid members as compared to a previous maximum of 231. There were 190 
additional subscriptions, as compared to a previous high of 152. As a result, 
receipts were about $400 higher than they were last year, while disbursements 
were over $1,000.00 lower. 
_ There were not only enough new subscriptions (more than were received dur- 
ing the three previous years) to bring the total to its present all-time high; there 
were also more cancellations and deletions, more changes of address, and more 
back-issue orders than there were in the previous three years. As a result of this 
situation it was necessary to employ more clerical help than ever before, and at 
the end of the fiscal year to employ a professional accountant to audit the books, 
prepare the treasurer’s report, and revise the bookkeeping system. 
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Corn Exchange Bank Trust Co. - - - - - $ 719.34 
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COMMENTS 


As may be seen from this report, the Society took in $1,290.25 and paid out 
$1,159.66 during the fiscal year ending June 30, 1947, leaving a surplus of $130.59. 
308798. the accumulated surplus from previous years, this gave a balance of 

7.93. 

The Society’s membership reached an all-time high by August 31, 1947, with 
251 paid memberships (including those of the applicants who were voted into the 
Society at the September 10 meeting), twenty more than the previous year. 

Turnover figures also reached an all-time high. More members sent in 
changes of address this year than in the three previous years combined, and de- 
spite the net increase in membership, more members resigned or were dropped 
for non-payment of dues this year than in the last three years combined. 














